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BASF NAE 678/03 1/357 .... 

JCURec'dPCWTO 0 8 JUL ZOOS 

sequence: listing • - • 

<110> BASF AG 

<120> Method for the genetic modification of organisms of the Blakesiea genus, 
corresponding organisms and use of the same 

<130> ? 

<160> 80 

<170> Patentln version 3.2 

<210> 1 

<211> 2160 

<212> DNA 

<213> Artificial 

<220> 

<223> Promoter 
<400> 1 

ctttcgacac tgaaatacgt cgagcctgct ccgcttggaa gcggcgagga gcctcgtcct 60 

gtcacaacta ccaacatgga gtacgataag ggccagttcc gccagctcat taagagccag 120 

ttcatgggcg ttggcatgat ggccgtcatg catctgtact tcaagtacac caacgctctt 180 

ctgatccagt cgatcatccg ctgaaggcgc tttcgaatct ggttaagatc cacgtcttcg 240 

ggaagccagc gactggtgac ctccagcgtc cctttaaggc tgccaacagc tttctcagcc 300 

agggccagcc caagaccgac aaggcctccc tccagaacgc cgagaagaac tggaggggtg 360 

gtgtcaagga ggagtaagct ccttattgaa gtcggaggac ggagcggtgt caagaggata 420 

ttcttcgact ctgtattata gataagatga tgaggaattg gaggtagcat agcttcattt 480 

ggatttgctt tccaggctga gactctagct tggagcatag agggtccttt ggctttcaat 540 

attctcaagt atctcgagtt tgaacttatt ccctgtgaac cttttattca ccaatgagca 600 
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1 1 gcaa~ gaa catgaatctg aggactgcaa tcgccatgag gttttcgaaa tacatcccca 660 

tgtcgaaggc rtggggcacc tgcgttggtt caatttagaa cgtggcacta ttgatcatcc 720 

gatagctctg caaagggcgt tgcacaatgc aagtcaaacg ttgctagcag ttccaggtgg "780 

aatgttatga ugagcattgt attaaatcag gagatatagc atgatctcta gttagctcac 840 

cacaaaagtc agacggcgta accaaaagtc acacaacaca agctgtaagg atttcggcac 900 

ggctacggaa gacggagaag ccaccttcag tggactcgag taccatttaa ttctatttgt 960 

gtttgarcga gacctaatac agcccctaca acgaccatca aagtcgtata gctaccagtg 1020 

aggaagtgga ctcaaatcga cttcagcaac atctcctgga taaactttaa gcctaaacta 1080 

tacagaataa gataggtgga gagcttatac cgagctccca aatctgtcca gatcatggtt 1140 

gaccggtgcc tggatcttcc tatagaatca tccttattcg ttgacctagc tgattctgga 1200 

gtgacccaga gggtcatgac ttgagcctaa aatccgccgc ctccaccatt tgtagaaaaa 1260 

tgtgacgaac tcgtgagctc tgtacagtga ccggtgactc tttctggcat gcggagagac 1320 

ggacggacgc agagagaagg gctgagtaat aagccactgg ccagacagct ctggcggctc 1380 

tgaggtgcag tggatgatta ttaatccggg accggccgcc cctccgcccc gaagtggaaa 1440 

ggctggtgtg cccctcgttg accaagaatc tattgcatca tcggagaata tggagcttca 1500 

tcgaatcacc ggcagtaagc gaaggagaat gtgaagccag gggtgtatag ccgtcggcga 1560 

aatagcatgc cattaaccta ggtacagaag tccaattgct tccgatctgg taaaagattc 1620 

acgagatagt accttctccg aagtaggtag agcgagtacc cggcgcgtaa gctccctaat 1680 

tggcccatcc ggcatctgta gggcgtccaa atatcgtgcc tctcctgctt tgcccggtgt 1740 

atgaaaccgg aaaggccgct caggagctgg ccagcggcgc agaccgggaa cacaagctgg 1800 

cagtcgaccc atccggtgct ctgcactcga cctgctgagg tccctcagtc cctggtaggc 1860 
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agctttcccc cgtcrgtccg cccggtgtgt cggcggggtt gacaaggrcg zzgcqtcact 1920 

ccaacatttg ttgccatatt ttcctgctct ccccaccaqc tgctcttttc ttttctcttt 1980 

cttttcccat cttcagtata ttcatcttcc catccaagaa cctttatttc ccctaagtaa 2040 

gtactttgct acatccatac tccatccttc ccatccctta ttcctttgaa cctttcagtt 2100 

cgagctttcc cacttcatcg cagcttgact aacagctacc ccgcttgagc agacatcacc 2160 

<210> 2 

<211> 774 

<212> DNA 

<213> Artificial 

<220> 

<223> Terminator 



<220> 

<221> misc_feature 

<222> (267) . . (267) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (475) . . (475) 

<223> n is a, c, .g, or t 

<220> 

<221> misc_f eature 

<222> (566) . . (566) 

<223> n is a, c, g, or t 

<400> 2 

cgatccactt aacgttactg aaatcatcaa acagcttgac gaatctggat ataagatcgt 60 

tggtgtcgat gtcagctccg gagttgagac aaatggtgtt caggatctcg ataagatacg 120 

ttcatttgtc caagcagcaa agagtgcctt ctagtgattt aatagctcca tgtcaacaag 180 
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aataaaacgc gttttcgggt tt 3 octette cagatacagc teatctgeaa tccat taatc 240 

cattgactgc aacctagtaa cgccttncag gctccggcga agacaagaat agcttaccac 300 

agctattttc atttteggga gacgagatca agcagatcaa eggtegtcaa gagacctacg 360 

agactgagga atccgctctt ggctccacgc gactatatat . ttgtctctaa ttgtactttc 420 

acatgctcct cttctttact ctgatagctt gactatgaaa attccgtcac cagcncctgg 480 

gttcgcaaag ataattgeat gtttcttcct tgaactctca agectacagg acacacattc 540 

ategtaggta taaacctcga aatcanttcc tactaagatg gtatacaata gtaaccatgc 600 

atggttgcct agtgaatgct ccgtaacacc caatacgccg gecgaaaett ttttacaact 660 

ctcctatgag tcgtttaccc agaatgeaca ggtacacttg tttagaggta atccttcttt 720 

ctagctagaa gtcctcgtgt actgtgtaag cgcccactcc acatctccac tcga 774 

<210> 3 

<211> 15739 

<212> DNA 

<213> Artificial 

<220> 

<223> Vector 
<220> 

<221> misc_f eature 

<222> (3471) . . (3471) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (3679) . . (3679) 

<223> n is a, c, g, or t 



<220> 
<221> 



misc feature 
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<222> (3770) . . (3770) 
<223> n is a, c, g, or t 

<400> 3 

gatctttcga cactgaaata cgtcgagcct gctccgcttg gaagcggcga ggagcctcgt 60 

cctgtcacaa ctaccaacat ggagtacgat aagggccagt tccgccagct cattaagagc 120 

cagttcatgg gcgttggcat gatggccgtc atgcatctgt acttcaagta caccaacgct 180 

cttctgatcc agtcgatcat ccgctgaagg cgctttcgaa tctggttaag atccacgtct 240 

tcgggaagcc agcgactggt gacctccagc gtccctttaa ggctgccaac agctttctca 300 

gccagggcca gcccaagacc gacaaggcct ccctccagaa cgccgagaag aactggaggg 360 

gtggtgtcaa ggaggagtaa gctccttatt gaagtcggag gacggagcgg tgtcaagagg 420 

atattcttcg actctgtatt atagataaga tgatgaggaa ttggaggtag catagcttca 480 

tttggatttg ctttccaggc tgagactcta gcttggagca tagagggtcc tttggctttc 540 

aatattctca agtatctcga gtttgaactt attccctgtg aaccttttat tcaccaatga 600 

gcattggaat gaacatgaat ctgaggactg caatcgccat gaggttttcg aaatacatcc 660 

ggatgtcgaa ggcttggggc acctgcgttg gttgaattta gaacgtggca ctattgatca 720 

tccgatagct ctgcaaaggg cgttgcacaa tgcaagtcaa acgttgctag cagttccagg -780 

tggaatgtta tgatgagcat tgtattaaat caggagatat agcatgatct ctagttagct 840 

caccacaaaa gtcagacggc gtaaccaaaa gtcacacaac acaagctgta aggatttcgg 900 

cacggctacg gaagacggag aagccacctt cagtggactc gagtaccatt taattctatt 960 

tgtgtttgat cgagacctaa tacagcccct acaacgacca tcaaagtcgt atagctacca 1020 

gtgaggaagt ggactcaaat cgacttcagc aacatctcct ggataaactt taagcctaaa 1080 

ctatacagaa taagataggt ggagagctta taccgagctc ccaaatctgt ccagatcatg 1140 
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gttgaccggt gcctggatct tcctatagaa tcatccttat tcgttgacct agctgatict 1200 

ggagtgaccc agagggtcat gacttgagcc taaaatccgc cgcctccacc atttgtagaa 1260 

aaatgtgacg aactcgtgag ctctgtacag tgaccggtga ctctttctgg catgcggaga 1320 

gacggacgga cgcagagaga agggctgagt aataagccac tggccagaca gctctggcgg 13S0 

ctctgaggtg cagtggatga ttattaatcc gggaccggcc gcccctccgc cccgaagtgg 1440 

aaaggctggt gtgcccctcg ttgaccaaga atctattgca tcatcggaga atatggagct 1500 

tcatcgaatc accggcagta agcgaaggag aatgtgaagc caggggtgta tagccgtcgg 1560 

cgaaatagca tgccattaac ctaggtacag aagtccaatt gcttccgatc tggtaaaaga 1620 

ttcacgagat agtaccttct ccgaagtagg tagagcgagt acccggcgcg taagctccct 1680 

aattggccca tccggcatct gtagggcgtc caaatatcgt gcctctcctg ctttgcccgg 1740 

tgtatgaaac cggaaaggcc gctcaggagc tggccagcgg cgcagaccgg gaacacaagc 1800 

tggcagtcga cccatccggt gctctgcact cgacctgctg aggtccctca gtccctggta 1860 

ggcagctttg ccccgtctgt ccgcccggtg tgtcggcggg gttgacaagg tcgttgcgtc 1920 

agtccaacat ttgttgccat attttcctgc tctccccacc agctgctctt ttcttttctc 1980 

tttcttttcc catcttcagt atattcatct tcccatccaa gaacctttat ttcccctaag 2040 

taagtacttt gctacatcca tactccatcc ttcccatccc ttattccttt gaacctttca 2100 

gttcgagctt tcccacttca tcgcagcttg actaacagct accccgcttg agcagacatc 2160 

accatgcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa gttcgacagc 2220 

gtctccgacc tgatgcagct ctcggagggc gaagaatctc gtgctttcag cttcgatgta 2280 

ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg atggtttcta caaagatcgt 2340 

tatgtttatc ggcactttgc atcggccgcg ctcccgattc cggaagtgct tgacattggg 2400 
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gaat ucagcg agagcctgac ctattgcatc tcccgccgtg cacagggtgt cacgttgcaa 2460 

gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg 2520 

atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc 2580 

ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac 2640 

tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg 2700 

atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga tttcggctcc 2760 

aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag cgaggcgatg 2820 

ttcggggatt cccaatacga ggtcgccaac atcttcttct ggaggccgtg gttggcttgt 2880 

atggagcagc agacgcgcta cttcgagcgg aggcatccgg agcttgcagg atcgccgcgg 2940 

ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt ggttgacggc 3000 

aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg atccggagcc 3060 

gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac cgatggctgt 3120 

gtagaagtac tcgccgatag tggaaaccga cgccccagca ctcgtccgag ggcaaaggaa 3180 

tagagtagat gccgaccgcg ggatcgatcc acttaacgtt actgaaatca tcaaacagct 3240 

tgacgaatct ggatataaga tcgttggtgt cgatgtcagc tccggagttg agacaaatgg 3300 

tgttcaggat ctcgataaga tacgttcatt tgtccaagca gcaaagagtg ccttctagtg 3360 

atttaatagc tccatgtcaa caagaataaa acgcgttttc gggtttacct cttccagata 3420 

cagctcatct gcaatgcatt aatgcattga ctgcaaccta gtaacgcctt ncaggctccg 3480 

gcgaagagaa gaatagctta gcagagctat tttcattttc gggagacgag atcaagcaga 3540 

tcaacggtcg tcaagagacc tacgagactg aggaatccgc tcttggctcc acgcgactat 3600 

atatttgtct ctaattgtac tttgacatgc tcctcttctt tactctgata gcttgactat 3660 
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gaaaattccg tcaccegcnc ctgggttcgc aaagataatt gcatgtttct tccrtgaacr 3720 

ctcaagccta caggacacac attcatcgta ggtataaacc tcgaaatcan rtcctactaa 3780 

gatggtatac aatagtaacc atgcatggtt gcctagtgaa tgctccgtaa cacccaatac 3840 

gccggccgaa acttttttac aactctccta tgagtcgttt acccagaatg cacaggtaca 3900 

cttgtttaga ggtaatcctt ctttctagct agaagtcctc gtgtactgtg taagcgccca 3960 

ctccacatct ccactcgacc tgcaggcatg caagcttggc gtaatcatgg tcatagctgt 4020 

ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa 4080 

agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac 4140 

tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 4200 

cggggagagg cggtttgcgt attgggccaa agacaaaagg gcgacattca accgattgag 4260 

ggagggaagg taaatattga cggaaattat tcattaaagg tgaattatca ccgtcaccga 4320 

cttgagccat ttgggaatta gagccagcaa aatcaccagt agcaccatta ccattagcaa 4380 

ggccggaaac gtcaccaatg aaaccatcga tagcagcacc gtaatcagta gcgacagaat 4440 

caagtttgcc tttagcgtca gactgtagcg cgttttcatc ggcattttcg gtcatagccc 4500 

ccttattagc gtttgccatc ttttcataat caaaatcacc ggaaccagag ccaccaccgg 4560 

aaccgcctcc ctcagagccg ccaccctcag aaccgccacc ctcagagcca ccaccctcag 4620 

agccgccacc agaaccacca ccagagccgc cgccagcatt gacaggaggc ccgatctagt 4 680 

aacatagatg acaccgcgcg cgataattta tcctagtttg cgcgctatat tttgttttct 4740 

atcgcgtatt aaatgtataa ttgcgggact ctaatcataa aaacccatct cataaataac 4800 

gtcatgcatt acatgttaat tattacatgc ttaacgtaat tcaacagaaa ttatatgata 4860 

atcatcgcaa gaccggcaac aggattcaat cttaagaaac tttattgcca aatgtttgaa 4920 
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cgatcgggga tcatccgggt ctgtggcggg aactccacga aaatatccga acgcagcaag 4SSC 

atatcgcgct gcatctcggt cttgcctggg cagtcgccgc cgacgccgtt gatgtggacg 5040 

ccgggcccga tcatattgtc gctcaggatc gtggcgttgt gcttgtcggc cgtcgctgtc 5100 

gtaatgatat cggcaccttc gaccgcctgt tccgcagaga tcccgtgggc gaagaactcc 5160 

agcatgagat ccccgcgctg gaggatcatc cagccggcgt cccggaaaac gattccgaag 5220 

cccaaccttt catagaaggc ggcggtggaa tcgaaatctc gtgatggcag gttgggcgtc 5280 

gcttggtcgg tcatttcgaa ccccagagtc ccgctcagaa gaactcgtca agaaggcgat 5340 

agaaggcgat gcgctgcgaa tcgggagcgg cgataccgta aagcacgagg aagcggtcag 5400 

cccattcgcc gccaagctct tcagcaatat cacgggtagc caacgctatg tcctgatagc 54 60 

ggtccgccac acccagccgg ccacagtcga tgaatccaga aaagcggcca ttttccacca 5520 

tgatattcgg caagcaggca tcgccatggg tcacgacgag atcatcgccg tcgggcatgc 5580 

gcgccttgag cctggcgaac agttcggctg gcgcgagccc ctgatgctct tcgtccagat 5640 

catcctgatc gacaagaccg gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg 5700 

cttggtggtc gaatgggcag gtagccggat caagcgtatg cagccgccgc attgcatcag 5760 

ccatgatgga tactttctcg gcaggagcaa ggtgagatga caggagatcc tgccccggca 5820 

cttcgcccaa tagcagccag tcccttcccg cttcagtgac aacgtcgagc acagctgcgc 5880 

aaggaacgcc cgtcgtggcc agccacgata gccgcgctgc ctcgtcctgc agttcattca 5940 

gggcaccgga caggtcggtc ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga 6000 

acacggcggc atcagagcag ccgattgtct gttgtgccca gtcatagccg aatagcctct 6060 

ccacccaagc ggccggagaa cctgcgtgca atccatcttg ttcaatcatg cgaaacgatc 6120 

cagatccggt gcagattatt tggattgaga gtgaatatga gactctaatt ggataccgag 6180 
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gggaatttat ggaacgtcag tggagcattt ttgacaagaa atatitgcta gctgatagtg 6240 

accttaggcg acttctgaac gcgcaataat ggtttctgac gtatgtgctt agcicattaa 6300 

actccagaaa cccgcggctg agtggctcct tcaacgttgc ggttctgtca gttccaaacg 6360 

taaaacggct tgtcccgcgt catcggcggg ggtcataacg . tgactccctt aattctccgc 6420 

tcatgatcag attgtcgttt cccgccttca gtttaaacta tcagtgtttg acaggatata 64S0 

ttggcgggta aacctaagag aaaagagcgt ttattagaat aatcggatat ttaaaagggc 6540 

gtgaaaaggt ttatccgttc gtccatttgt atgtgcatgc caaccacagg gttccccaga 6600 

tctggcgccg gccagcgaga cgagcaagat tggccgccgc ccgaaacgat ccgacagcgc 6660 

gcccaqcaca ggtgcgcagg caaattgcac caacgcatac agcgccagca gaatgccata 6720 

gtgggcggtg acgtcgttcg agtgaaccag atcgcgcagg aggcccggca gcacc'ggcat 6780 
aatcaggccg atgccgacag cgtcgagcgc gacagtgctc agaattacga tcaggggtat . 6840 

gttgggtttc acgtctggcc tccggaccag cctccgctgg tccgattgaa cgcgcggatt 6900 

ctttatcact gataagttgg tggacatatt atgtttatca gtgataaagt gtcaagcatg 6960 

acaaagttgc agccgaatac agtgatccgt gccgccctgg acctgttgaa cgaggtcggc 7020 

gtagacggtc tgacgacacg caaactggcg gaacggttgg gggttcagca gccggcgctt 7080 

tactggcact tcaggaacaa gcgggcgctg ctcgacgcac tggccgaagc catgctggcg 7140 

gagaatcata cgcattcggt gccgagagcc gacgacgact ggcgctcatt tctgatcggg 7200 

aatgcccgca gcttcaggca ggcgctgctc gcctaccgcg atggcgcgcg catccatgcc 7260 

ggcacgcgac cgggcgcacc gcagatggaa acggccgacg cgcagcttcg cttcctctgc 7320 

gaggcgggtt tttcggccgg ggacgccgtc aatgcgctga tgacaatcag ctacttcact 7380 

gttggggccg tgcttgagga gcaggccggc gacagcgatg ccggcgagcg cggcggcacc 7440 
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gttgaacagg ctccgctctc gccgctgttg cgggccgcga tagacgcctt cgacgaagcc 7500 

ggtccggacg cagcgttcga gcagggactc gcggtgattg tcgatggatt: ggcgaaaagg 7560 

aggctcgttg tcaggaacgt tgaaggaccg agaaagggtg acgattgatc aggaccgctg 7620 

ccggagcgca acccactcac tacagcagag ccatgtagac aacatcccct ccccctttcc 7680 

accgcgtcag acgcccgtag cagcccgcta cgggcttttt catgccctgc cctagcgtcc 7740 

aagcctcacg gccgcgctcg gcctctctgg cggccttctg gcgctcttcc gcttcctcgc 7800 

tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7860 

cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7920 

gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7980 

gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 8040 

gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 8100 

ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgcttttcc 8160 

gctgcataac cctgcttcgg ggtcattata gcgatttttt cggtatatcc atcctttttc 8220 

gcacgatata caggattttg ccaaagggtt cgtgtagact ttccttggtg tatccaacgg 8280 

cgtcagccgg gcaggatagg tgaagtaggc ccacccgcga gcgggtgttc cttcttcact 8340 

gtcccttatt cgcacctggc ggtgctcaac gggaatcctg ctctgcgagg ctggccggct 8400 

accgccggcg taacagatga gggcaagcgg atggctgatg aaaccaagcc aaccaggaag 84 60 

ggcagcccac ctatcaaggt gtactgcctt ccagacgaac gaagagcgat tgaggaaaag 8520 

gcggcggcgg ccggcatgag cctgtcggcc tacctgctgg ccgtcggcca gggctacaaa 8580 

atcacgggcg tcgtggacta tgagcacgtc cgcgagctgg cccgcatcaa tggcgacctg 8640 

ggccgcctgg gcggcctgct gaaactctgg ctcaccgacg acccgcgcac ggcgcggttc 8700 
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ggtgatgcca cgatcctcgc cctgctggcg aagatcgaag agaagcagga cgagcttggc 8760 

aaggtcatga tgggcgtggr ccgcccgagg gcagagccat gactttttta gccgctaaaa SS20 

cggccggggo gtgcgcgtga ttgccaagca cgtccccatg cgctccatca agaagagcga 8880 

cttcgcggag ctggtgaagt acatcaccga cgagcaaggc aagaccgagc gcctttgcga 8940 

cgctcaccgg gctggttgcc ctcgccgctg ggctggcggc cgtctatggc cctgcaaacg 9000 

cgccagaaac gccgtcgaag ccgtgtgcga gacaccgcgg ccgccggcgt tgtggatacc 9060 

tcgcggaaaa cttggccctc actgacagat gaggggcgga cgttgacact tgaggggccg 9120 

actcacccgg cgcggcgttg acagatgagg ggcaggctcg atttcggccg gcgacgtgga 9180 

gctggccagc ctcgcaaatc ggcgaaaacg cctgatttta cgcgagtttc ccacagatga 9240 

tgtggacaag cctggggata agtgccctgc ggtattgaca cttgaggggc gcgactactg 9300 

acagatgagg ggcgcgatcc ttgacacttg aggggcagag tgctgacaga tgaggggcgc 9360 

acctattgac atttgagggg ctgtccacag gcagaaaatc cagcatttgc aagggtttcc 9420 

gcccgttttt cggccaccgc taacctgtct tttaacctgc ttttaaacca atatttataa 9480 

accttgtttt taaccagggc tgcgccctgt gcgcgtgacc gcgcacgccg aaggggggtg 9540 

cccccccttc tcgaaccctc ccggcccgct aacgcgggcc tcccatcccc ccaggggctg 9600 

cgcccctcgg ccgcgaacgg cctcacccca aaaatggcag cgctggcagt ccttgccatt 9660 

gccgggatcg gggcagtaac gggatgggcg atcagcccga gcgcgacgcc cggaagcatt 9720 

gacgtgccgc aggtgctggc atcgacattc agcgaccagg tgccgggcag tgagggcggc 9780 

ggcctgggtg gcggcctgcc cttcacttcg gccgtcgggg cattcacgga cttcatggcg 9840 

gggccggcaa tttttacctt gggcattctt ggcatagtgg tcgcgggtgc cgtgctcgtg 9900 

ttcgggggtg cgataaaccc agcgaaccat ttgaggtgat aggtaagatt ataccgaggt 9960 
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atgaaaacga gaattggacc tttacagaat tactctatga agcgccatat tiaaaaagct 10020 

accaagacga agaggatgaa gaggatgagg aggcagattg ccttgaatat attgacaata 10080 

ctgataagat aatatatctt ttatatagaa gatatcgccg tatgtaagga tttcaggggg 10140 

caaggcaTiag gcagcgcgct tatcaatata tctatagaat gggcaaagca taaaaacttg 10200 

catggactaa tgcttgaaac ccaggacaat aaccttatag cttgtaaatt ctatcataat 10260 

tgggtaatga ctccaactta ttgatagtgt tttatgttca gataatgccc gatgactttg 10320 

tcatgcagct ccaccgattt tgagaacgac agcgacttcc gtcccagccg tgccaggtgc 10380 

tgcctcagat tcaggttatg ccgctcaatt cgctgcgtat atcgcttgct gattacgtgc 10440 

agctttccct tcaggcggga ttcatacagc ggccagccat ccgtcatcca tatcaccacg 10500 

tcaaagggtg acagcaggct cataagacgc cccagcgtcg ccatagtgcg ttcaccgaat 10560 

acgtgcgcaa caaccgtctt ccggagactg tcatacgcgt aaaacagcca gcgctggcgc 10620 

gatttagccc cgacatagcc ccactgttcg tccatttccg cgcagacgat gacgtcactg 10680 

cccggctgta tgcgcgaggt taccgactgc ggcctgagtt ttttaagtga cgtaaaatcg 10740 

tgttgaggcc aacgcccata atgcgggctg ttgcccggca tccaacgcca ttcatggcca 10800 

tatcaatgat tttctggtgc gtaccgggtt gagaagcggt gtaagtgaac tgcagttgcc 10860 

atgttttacg gcagtgagag cagagatagc gctgatgtcc ggcggtgctt ttgccgttac 10920 

gcaccacccc gtcagtagct gaacaggagg ga'cagctgat agacacagaa gccactggag 10980 

cacctcaaaa acaccatcat acactaaatc agtaagttgg cagcatcacc cataattgtg 11040 

gtttcaaaat cggctccgtc gatactatgt tatacgccaa ctttgaaaac aactttgaaa 11100 

aagctgtttt ctggtattta aggttttaga atgcaaggaa cagtgaattg gagttcgtct 11160 

tgttataatt agcttcttgg ggtatcttta aatactgtag aaaagaggaa ggaaataata 11220 
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aatggctaaa atgagaatat caccggaatt gaaaaaactg atcgaaaaat accgctgcgt 11280 

aaaagatacg gaaggaatgt ctcctgctaa ggtatataag ctggtgggag aaaatgaaaa 11340 

cctatattta aaaatgacgg acagccggta taaagggacc acctatgatg tggaacggga 11400 

aaaggacatg atgctatggc tggaaggaaa gctgcctgtt ccaaaggtcc tgcactttga 11460 

acggcatgat ggctggagca atctgctcat gagtgaggcc gatggcgtcc tttgctcgga 11520 

agagtatgaa gatgaacaaa gccctgaaaa gattatcgag ctgtatgcgg agtgcatcag 11580 

gctctttcac tccatcgaca tatcggattg tccctatacg aatagcttag acagccgctt 11640 

agccgaattg gattacttac tgaataacga tctggccgat gtggattgcg aaaactggga 11700 

agaagacact ccatttaaag atccgcgcga gctgtatgat tttttaaaga cggaaaagcc 11760 

cgaagaggaa cttgtctttt cccacggcga cctgggagac agcaacatct ttgtgaaaga 11820 

tggcaaagta agtggcttta ttgatcttgg gagaagcggc agggcggaca agtggtatga 11880 

cattgccttc tgcgtccggt cgatcaggga ggatatcggg gaagaacagt atgtcgagct 11940 

attttttgac ttactgggga tcaagcctga ttgggagaaa ataaaatatt atattttact 12000 

ggatgaattg ttttagtacc tagatgtggc gcaacgatgc cggcgacaag caggagcgca 12060 

ccgacttctt ccgcatcaag tgttttggct ctcaggccga ggcccacggc aagtatttgg 12120 

gcaaggggtc gctggtattc gtgcagggca agattcggaa taccaagtac gagaaggacg 12180 

gccagacggt ctacgggacc gacttcattg ccgataaggt ggattatctg gacaccaagg 12240 

caccaggcgg gtcaaatcag gaataagggc acattgcccc ggcgtgagtc ggggcaatcc 12300 

cgcaaggagg gtgaatgaat cggacgtttg accggaaggc atacaggcaa gaactgatcg 12360 

acgcggggtt ttccgccgag gatgccgaaa ccatcgcaag ccgcaccgtc atgcgtgcgc 12420 

cccgcgaaac cttccagtcc gtcggctcga tggtccagca agctacggcc aagatcgagc 12480 
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gcgacagcgt gcaactggct ccccctgccc tgcccgcgcc atcggccgcc gtggagcgtt 12540 

cgcgtcgtct cgaacaggag gcggcaggtt tggcgaagtc gatgaccaic gacacgcgag 12600 

gaactatgac gaccaagaag cgaaaaaccg ccggcgagga cctggcaaaa caggtcagcg 12660 

aggccaagca ggccgcgttig ctgaaacaca cgaagcagca gatcaa'ggaa argcagcttt 12720 

ccttgttcga tattgcgccg tggccggaca cgatgcgagc gatgccaaac gacacggccc 12780 

gctctgccct gttcaccacg cgcaacaaga aaatcccgcg cgaggcgctg caaaacaagg 12840 

tcattttcca cgtcaacaag gacgtgaaga tcacctacac cggcgtcgag ctgcgggccg 12900 

acgatgacga actggtgtgg cagcaggtgt tggagtacgc gaagcgcacc cctatcggcg 12960 

agccgatcac cttcacgttc tacgagcttt gccaggacct gggctggtcg atcaatggcc 13020 

ggtattacac gaaggccgag gaatgcctgt cgcgcctaca ggcgacggcg atgggcttca 13080 

cgtccgaccg cgttgggcac ctggaatcgg tgtcgctgct gcaccgcttc cgcgtcctgg 13140 

accgtggcaa gaaaacgtcc cgttgccagg tcctgatcga cgaggaaatc gtcgtgctgt 13200 

ttgctggcga ccactacacg aaattcatat gggagaagta ccgcaagctg tcgccgacgg 13260 

cccgacggat gttcgactat ttcagctcgc accgggagcc gtacccgctc aagctggaaa 13320 

ccttccgcct catgtgcgga tcggattcca cccgcgtgaa gaagtggcgc gagcaggtcg 13380 

gcgaagcctg cgaagagttg cgaggcagcg gcctggtgga acacgcctgg gtcaatgatg 13440 

acctggtgca ttgcaaacgc tagggccttg tggggtcagt tccggctggg ggttcagcag 13500 

ccagcgcttt actggcattt caggaacaag cgggcactgc tcgacgcact tgcttcgctc 13560 

agtatcgctc gggacgcacg gcgcgctcta cgaactgccg ataaacagag gattaaaatt 13620 

gacaattgtg attaaggctc agattcgacg gcttggagcg gccgacgtgc aggatttccg 13680 

cgagatccga ttgtcggccc tgaagaaagc tccagagatg ttcgggtccg tttacgagca 13740 
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cgaggagaaa aagcccargg aggcgttcgc tgaacggttg cgagatgccg tggcattcgg 13800 

cgcctacatc gacggcgaga tcattgggct gtcggtcttc aaacaggagg acggccccaa 13S60 

ggacgctcac aaggcgcatc tgtccggcgt tttcgtggag cccgaacagc gaggccgagg 13920 

ggrcgccggt atgctgctgc gggcgttgcc ggcgggttta ttgctcgtga tgarcgtccg 13980 

acagattcca acgggaatct ggtggatgcg catcttcatc ctcggcgcac ttaatatttc 14040 

gctattctgg agcttgttgt ttatttcggt ctaccgcctg ccgggcgggg tcgcggcgac 14100 

ggtaggcgct gtgcagccgc tgatggtcgt gttcatctct gccgctctgc taggtagccc 14 160 

gatacgattg atggcggtcc tgggggctat ttgcggaact gcgggcgtgg cgctgttggt 14220 

gttgacacca aacgcagcgc tagatcctgt cggcgtcgca gcgggcctgg cgggggcggt 14280 

ttccatggcg ttcggaaccg tgctgacccg caagtggcaa cctcccgtgc ctctgctcac 14340 

ctttaccgcc tggcaactgg cggccggagg acttctgctc gttccagtag ctttagtgtt -14400 

tgatccgcca atcccgatgc ctacaggaac caatgttctc ggcctggcgt ggctcggcct 144 60 

gatcggagcg ggtttaacct acttcctttg gttccggggg atctcgcgac tcgaacctac 14520 

agttgtttcc ttactgggct ttctcagccc cagatctggg gtcgatcagc cggggatgca 14580 

tcaggccgac ■ agtcggaact tcgggtcccc gacctgtacc attcggtgag caatggatag 14640 

gggagttgat atcgtcaacg ttcacttcta aagaaatagc gccactcagc ttcctcagcg 14700 

gctttatcca gcgatttcct attatgtcgg catagttctc aagatcgaca gcctgtcacg 14760 

gttaagcgag aaatgaataa gaaggctgat aattcggatc tctgcgaggg agatgatatt 14820 

tgatcacagg cagcaacgct ctgtcatcgt tacaatcaac atgctaccct ccgcgagatc 14880 

atccgtgttt caaacccggc agcttagttg ccgttcttcc gaatagcatc ggtaacatga 14940 

gcaaagtctg ccgccttaca acggctctcc cgctgacgcc gtcccggact gatgggctgc 15000 
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ctgtatcgag tggtgatttt gtgccgagct gccggtcggg gagctgttgg ctggctggtg 15060 

gcaggatata ttgrggtgta aacaaattga cgcttagaca acttaataac acattgcgga I5I20 

cgtttttaat gtactggggt ggtttttctt ttcaccagtg agacgggcaa cagctgattg 15180 

cccttcaccg cctggccctg agagagttgc agcaagcggt . ccacgctggt ttgccccagc 15240 

aggcgaaaat cctgtttgat ggtggttccg aaatcggcaa aatcccttat aaatcaaaag 15300 

aatagcccga gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga 15360 

acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg 15420 

aaccatcacc caaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc 15480 

ctaaagggag cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg 15540 

aagggaagaa agcgaaagga gcgggcgcca ttcaggctgc gcaactgttg ggaagggcga 15600 

tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 15660 

ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 15720 

ttcgagctcg gtacccggg 15739 
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<221> misc_feature 
<222> (318) . . (318) 
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<220> 

<221> misc_f eature 

<222> (526) . . (526) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (8946) . . (8946) 
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<220> 

<221> mi sc_f eature 

<222> (10028) . . (10028) 

<223> n is a, c, g, or t 

<400> 4 

agcttgcatg cctgcaggtc gagtggagat gtggagtggg cgcttacaca gtacacgagg 60 

acttctagct agaaagaagg attacctcta aacaagtgta cctgtgcatt ctgggtaaac 120 

gactcatagg agagttgtaa aaaagtttcg gccggcgtat tgggtgttac ggagcattca 180 

ctaggcaacc atgcatggtt actattgtat accatcttag taggaantga tttcgaggtt 240 

tatacctacg atgaatgtgt gtcctgtagg cttgagagtt caaggaagaa acatgcaatt 300 

atctttgcga acccaggngc tggtgacgga attttcatag tcaagctatc agagtaaaga 360 

agaggagcat gtcaaagtac aattagagac aaatatatag tcgcgtggag ccaagagcgg 420 

attcctcagt ctcgtaggtc tcttgacgac cgttgatctg cttgatctcg tctcccgaaa 480 

atgaaaatag ctctgctaag ctattcttct cttcgccgga gcctgnaagg cgttactagg 540 

ttgcagtcaa tgcattaatg cattgcagat gagctgtatc tggaagaggt aaacccgaaa 600 

acgcgtttta ttcttgttga catggagcta ttaaatcact agaaggcact ctttgctgct 660 
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tggacaaatg aacgtatctt atcgagatcc tgaacaccat ttgtctcaac tccggagctg 720 

acatcgacac caacgatctt atatccagat tcgtcaagct gtttgatgat ttcagtaacc 780 

ttaagtgga.t cgatcccgcg gtcggcatct actctattcc tttgccctcg gacgagtgct 840 

ggggcgncgg tttccactat cggcgagtac ttctacacag ccatcggtcc agacggccgc 900 

gcttctgcgg gcgatttgtg tacgcccgac agtcccggct ccggatcgga cgattgcgtc 960 

gcatcgaccc tgcgcccaag ctgcatcatc gaaattgccg tcaaccaagc tctgatagag 1020 

ttggtcaaga ccaatgcgga gcatatacgc ccggagccgc ggcgatcctg caagctccgg 1080 

atgcctccgc tcgaagtagc gcgtctgctg ctccatacaa gccaaccacg gcctccagaa 1140 

gaagatgttg gcgacctcgt attgggaa.tc cccgaacatc gcctcgctcc agtcaatgac 1200 

cgctgttatg cggccattgt ccgtcaggac attgttggag ccgaaatccg cgtgcacgag 1260 

gtgccggact tcggggcagt cctcggccca aagcatcagc tcatcgagag cctgcgcgac 1320 

ggacgcactg acggtgtcgt ccatcacagt ttgccagtga tacacatggg gatcagcaat 1380 

cgcgcatatg aaatcacgcc atgtagtgta ttgaccgatt ccttgcggtc cgaatgggcc 1440 

gaacccgctc gtctggctaa gatcggccgc agcgatcgca tccatggcct ccgcgaccgg 1500 

ctgcagaaca gcgggcagtt cggtttcagg caggtcttgc aacgtgacac cctgtgcacg 1560 

gcgggagatg caataggtca ggctctcgct gaattcccca atgtcaagca cttccggaat 1620 

cgggagcgcg gccgatgcaa agtgccgata aacataacga tctttgtaga aaccatcggc 1680 

gcagctattt acccgcagga catatccacg ccctcctaca tcgaagctga aagcacgaga 1740 

ttcttcgccc tccgagagct gcatcaggtc ggagacgctg tcgaactttt cgatcagaaa 1800 

cttctcgaca gacgtcgcgg tgagttcagg catggtgatg tctgctcaag cggggtagct 1860 

gttagtcaag ctgcgatgaa gtgggaaagc tcgaactgaa aggttcaaag gaataaggga 1920 
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tgggaaggat ggagtatgga tgtagcaaag tacttactta ggggaaataa aggttcttgg IS80 

atgggaagat gaatatactg aagatgggaa aagaaagaga aaagaaaaga gcagctggtg 2040 

gggagagcag gaaaatatgg caacaaatgt tggactgacg caacgacctt gtcaaccccg 2100 

ccgacacacc gggcggacag acggggcaaa gctgcctacc agggactgag ggacctcagc 2160 

aggtcgagtg cagagcaccg gatgggtcga ctgccagctt gtgttcccgg tctgcgccgc 2220 

tggccagctc ctgagcggcc tttccggttt catacaccgg gcaaagcagg agaggcacga 2280 

tatttggacg ccctacagat gccggatggg ccaattaggg agcttacgcg ccgggtactc 2340 

gctctaccta cttcggagaa ggtactatct cgtgaatctt ttaccagatc ggaagcaatt 2400 

ggacttctgt acctaggtta atggcatgct atttcgccga cggctataca cccctggctt 2460 

cacattctcc ttcgcttact gccggtgatt cgatgaagct ccatattctc cgatgatgca 2520 

atagattctt ggtcaacgag gggcacacca gcctttccac ttcggggcgg aggggcggcc 2580 

ggtcccggat taataatcat ccactgcacc tcagagccgc cagagctgtc tggccagtgg 2640 

cttattactc agcccttctc tctgcgtccg tccgtctctc cgcatgccag aaagagtcac 2700 

cggtcactgt acagagctca cgagttcgtc acatttttct acaaatggtg gaggcggcgg 2760 

attttaggct caagtcatga ccctctgggt cactccagaa tcagctaggt caacgaataa 2820 

ggatgattct ataggaagat ccaggcaccg gtcaaccatg atctggacag atttgggagc 2880 

tcggtataag ctctccacct atcttattct gtatagttta ggcttaaagt ttatccagga 2940 

gatgttgctg aagtcgattt gagtccactt cctcactggt agctatacga ctttgatggt 3000 

cgttgtaggg gctgtattag gtctcgatca aacacaaata gaattaaatg gtactcgagt 3060 

ccactgaagg tggcttctcc gtcttccgta gccgtgccga aatccttaca gcttgtgttg 3120 

tgtgactttt ggttacgccg tctgactttt gtggtgagct aactagagat catgctatat 3180 
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ctccigatut aatacaatgc tcatcataac attccacctg gaactgctag caacgtttga 3240 

cttgcattgt gcaacgccct ttgcagagct atcggatgat caatagtccc accttctaaa 3300 

ttcaaccaac gcaggtgccc caagccttcg acarccggat gtarttcgaa aacctcatgg 3360 

cgattgcagt cctcagattc atgttcattc caatgctcat tggtgaataa aaggttcaca 3420 

gggaataagt tcaaactcga gatacttgag aatattgaaa gccaaaggac cctctatgct 3480 

ccaagctaga gtctcagcct ggaaagcaaa tccaaatgaa gctatgctac ctccaattcc 3540 

tcatcatctt atctataata cagagtcgaa gaatatcctc ttgacaccgc tccgtcctcc 3600 

gacttcaata aggagcttac tcctccttga caccacccct ccagttcttc tcggcgttct 3660 

ggagggaggc cttgtcggtc ttgggctggc cctggctgag aaagctgttg gcagccttaa 3720 

agggacgctg gaggtcacca gtcgctggct tcccgaagac gtggatctta accagattcg 3780 

aaagcgcctt cagcggatga tcgactggat cagaagagcg ttggtgtact tgaagtacag 3840 

atgcatgacg gccatcatgc caacgcccat gaactggctc ttaatgagct ggcggaactg 3900 

gcccttatcg tactccatgt tggtagttgt gacaggacga ggctcctcgc cgcttccaag 3960 

cggagcaggc tcgacgtatt tcagtgtcga aagatctgat caagagacag gatgaggatc 4020 

gtttcgcatg attgaacaag atggattgca cgcaggttct ccggccg'ctt gggtggagag 4080 

gctattcggc tatgactggg cacaacagac aatcggctgc tctgatgccg ccgtgttccg 4140 

gctgtcagcg caggggcgcc cggttctttt tgtcaagacc gacctgtccg gtgccctgaa 4200 

tgaactgcag gacgaggcag cgcggctatc gtggctggcc acgacgggcg ttccttgcgc 4260 

agctgtgctc gacgttgtca ctgaagcggg aagggactgg ctgctattgg gcgaagtgcc 4320 

ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca tcatggctga 4380 

tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc ccattcgacc accaagcgaa 4440 
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acatcgcatc gagcgaccac gtactcggat ggaagcccct cttgtcgarc aggatgarct 4500 

ggacgaagag catcaggggc tcgcgccagc cgaactgttc cccaggctca aggcgcgcat 4560 

gcccgacggc gaggatctcg tcgtgaccca tggcgatgcc tgcttgccga atatcatggt 4620 

ggaaaatggc cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg cggaccgcta 4680 

tcaggacata gcgttggcta cccgtgatat tgctgaagag cttggcggcg aatgggctga 4740" 

ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg ccttctatcg 4800 

ccttcttgac gagttcttct gagcgggact ctggggttcg aaatgaccga ccaagcgacg 4860 

cccaacctgc catcacgaga tttcgattcc accgccgcct tctatgaaag gttgggcttc 4920 

ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct catgctggag 4980 

ttcttcgccc accccgggct cgatcccctc gcgagttggt tcagctgctg cctgaggctg 5040 

gacgacctcg cggagttcta ccggcagtgc aaatccgtcg gcatccagga aaccagcagc 5100 

ggctatccgc gcatccatgc ccccgaactg caggagtggg gaggcacgat ggccgctttg 5160 

gtccggatct ttgtgaagga accttacttc tgtggtgtga cataattgga caaactacct 5220 

acagagattt aaagctctaa ggtaaatata aaatttttaa gtgtataatg tgttaaacta 5280 

ctgattctaa ttgtttgtgt attttagatt ccaacctatg gaactgatga atgggagcag 5340 

tggtggaatg cctttaatga ggaaaacctg ttttgctcag aagaaatgcc atctagtgat 5400 

gatgaggcta ctgctgactc tcaacattct actcctccaa aaaagaagag aaaggtagaa 5460 

gaccccaagg actttccttc agaattgcta agttttttga gtcatgctgt gtttagtaat 5520 

agaactcttg cttgctttgc tatttacacc acaaaggaaa aagctgcact gctatacaag 5580 

aaaattatgg aaaaatattc tgtaaccttt ataagtaggc ataacagtta taatcataac 5640 

atactgtttt ttcttactcc acacaggcat agagtgtctg ctattaataa ctatgctcaa 5700 
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aaattgtgta cctttagctt tttaatttgt aaaggggtta ataacgaata tttgatgtat 5760 

agtgccttga ctagagatca taatcagcca taccacattt gtagaggttt tacttgcttt 5S20 

aaaaaacctc ccacacctcc ccctgaacct gaaacataaa atgaatgcaa ttgt tetter 5SS0 

taacttgttt attgeagett ataatggtta caaataaagc aatagcatca caaatttcac 5940 

aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc 6000 

ttatcatgtc tggatctgac gggtgcgcat gategtgetc ctgtcgttga ggacccggct 6060 

aggctggegg ggttgcctta ctggttagca gaatgaatca ccgatacgcg agegaaegtg 6120 

aagcgactgc tgctgcaaaa cgtctgcgac ctgagcaaca acatgaatgg tcttcggttt 6180 

ccgtgtttcg taaagtctgg aaacgcggaa gtcagcgctc ttccgcttcc tcgct'cactg 6240 

actcgctgcg cteggtegtt cggctgcggc gageggtate agctcactca aaggcggtaa 6300 

tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca aaaggecage 6360 

aaaaggccag caaaaggeca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 6420 

gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 6480 

gacaggacta taaagatacc aggegtttec ccctggaagc tccctcgtgc gctctcctgt 6540 

tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ecttegggaa gcgtggcgct 6600 

ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 6660 

ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actategtet 6720 

tgagtccaac ceggtaagae acgacttatc gccactggca gcagccactg gtaacaggat 6780 

tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc etaactaegg 6840 

ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ectteggaaa 6900 
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aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 6960 

ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 7020 

tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 7080 

atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 7140 

aagtatatat gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg aggcacctat 7200 

ctcagcgatc tgtctatttc gttcatccat agttgcctga ctccccgtcg tgtagataac 7260 

tacgatacgg gagggcttac catctggccc cagtgctgca atgataccgc gagacccacg 7320 

ctcaccggct ccagatttat cagcaataaa ccagccagcc ggaagggccg agcgcaga.ag 7380 

tggtcctgca actttatccg cctccatcca gtctattaat tgttgccggg aagctagagt 7440 

aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc attgctgcag gcatcgtggt 7500 

gtcacgctcg tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt 7560 

tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt 7620 

cagaagtaag ttggccgcag tgttatcact catggttatg gcagcactgc ataattctct 7680 

tactgtcatg ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt 7740 

ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg gcgtcaacac gggataatac 7800 

cgcgccacat agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa 7860 

actctcaagg atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacccaa 7920 

ctgatcttca gcatctttta ctttcaccag cgtttctggg tgagcaaaaa caggaaggca 7980 

aaatgccgca aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct 8040 

ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga 8100 

atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccacc 8160 
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tgacgtctaa gaaaccatta ttatcatgac attaacctat aaaaataggc gtatcacgag 8220 

gccctttcgt crtcaagaat tcgcggccgc aattaaccct cactaaagga tccctatagt S280 

gagtcgtatt atgcggccgc gaattctcat gtttgaccgc ttatcatcga taagctctgc 3340 

tttttgttga cttccattgt tcattccacg gacaaaaaca . gagaaaggaa acgacagagg 8400 

ccaaaaagct cgctttcagc acctgtcgtt tcctttcttt tcagagggta ttttaaataa 8460 

aaacattaag ttatgacgaa gaagaacgga aacgccttaa accggaaaat tttcataaat 8520 

agcgaaaacc cgcgaggtcg ccgccccgta acaaggcgga tcgccggaaa ggacccgcaa 8580 

atgataataa ttatcaattg catactatcg acggcactgc tgccagataa caccaccggg 8640 

gaaacattcc atcatgatgg ccgtgcggac ataggaagcc agttcatcca tcgctttctt 8700 

gtctgctgcc atttgctttg tgacatccag cgccgcacat tcagcagcgt ttttcagcgc 8760 

g.ttttcgatc aacgtttcaa tgttggtatc aacaccaggt ttaactttga acttatcggc 8820 

actgacggtt accttgttct gcgctggctc atcacgcagg ataccaaggc tgatgttgta 8880 

gatattggtc accggctgag ggttttcgat tgccgctgcg tggatagcac catttgcgat 8940 

caggcngtcc ttgatgaatg acactccatt gcgaataagt tcgaaggaga cggtgtcacg 9000 

aatgcgctgg tccagctcgg tcgattgcct tttgtgcagc agaggtatca atctcaacgc 9060 

caaggctcat cgaagcgcaa tattgctgct caccaaaacg cgtattgacc aggtgttcaa 9120 

cggcaaattt ctgcccttct gatgtcagaa aggcaaagtg attttctttc tggtattcag 9180 

ttgctgtgtg tcggtttcag caaaaccaag ctcgcgcaat tcggctgtgc agatttagaa 9240 

ggcagatcac cagacagcaa cggccaacgg aaaacagcgc atacagaaca tccgtcgccg 9300 

cgccgacaac gtgataattt ttatgaccca tgatttattt ccttttagac gtgagcctgt 9360 

cgcacagcaa agccgccgaa agttcctcga agctagcttc agacgtgtct agatacgtct 9420 
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gcttrttgtt gacttccatt gttcattcca cggacaaaaa cagacaaagg aaacgacaga 9480 

ggccaaaaac ctcgctttca gcacctgtcg tttcctttct tttcagaggg tattttaaat 9540 

aaaaacatta agttatgacg aagaagaacg gaaacgcctt aaaccggaaa attttcataa 9600 

atagcgaaaa cccgcgaggt cgccgccccg taacaaggcg gatcgccgga aaggacccgc 9660 

aaatgataat aattatcaat tgcatactat cgacggcact gctgccagat aacaccaccg 9720 

gggaaacatt ccatcatgat ggccgtgcgg acataggaag ccagttcatc catcgctttc 9780 

ttgtctgctg ccatttgctt tgtgacatcc agcgccgcac attcagcagc gtttttcagc 9840 

gcgttttcga tcaacgtttc aatgttggta tcaacaccag gtttaacttt gaacttatcg 9900 

gcactgacgg ttaccttgtt ctgcgctggc tcatcacgca ggataccaag gctgatgttg 9960 

tagatattgg tcaccggctg agggttttcg attgccgctg cgtggatagc accatttgcg 10020 

atcaggcngt ccttgatgaa tgacactcca ttgcgaataa gttcgaagga gacggtgtca 10080 

cgaatgcgct ggtccagctc ggtcgattgc cttttgtgca gcagaggtat caatctcaac 10140 

gccaaggctc atcgaagcgc aatattgctg ctcaccaaaa cgcgtattga ccaggtgttc 10200 

aacggcaaat ttctgccctt ctgatgtcag aaaggcaaag tgattttctt tctggtattc 10260 

agttgctgtg tgtcggtttc agcaaaacca agctcgcgca attcggctgt gcagatttag 10320 

aaggcagatc accagacagc aacggccaac ggaaaacagc gcatacagaa catccgtcgc 10380 

cgcgccgaca acgtgataat ttttatgacc catgatttat ttccttttag acgtgagcct 10440 

gtcgcacagc aaagccgccg aaagttcctc gaccgatgcc cttgagagcc ttcaacccag 10500 

tcagctcctt ccggtgggcg cggggcatga ctatcgtcgc cgcacttatg actgtcttct 10560 

ttatcatgca actcgtagga caggtgccgg cagcgctctg ggtcatttto ggcgaggacc 10620 

gctttcgctg gagcgcgacg atgatcggcc tgtcgcttgc ggtattcgga atcttgcacg 10680 
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ccctcgctca agccttcgtc actggtcccg ccaccaaacg tttcgcccag aagcaggcca 10740 

ttatcgccgg catggcggcc gacgcgctgg gctacgtctt gctggcgrtc gcgacgcgag 10800 

gctggatggc cttccccatt atgattcttc tcgcttccgg cggcatcggg atgcccgcgt 10860 

tgcaggccat gctgtccagg caggtagatg acgaccatca gggacagctt caaggatcgc 10920 

tcgcggctct taccagccta acttcgatca ttggaccgct gatcgtcacg gcgatttatg 10980 

ccgcctcggc gagcacatgg aacgggttgg catggattgt aggcgccgcc ctataccttg 11040 

tctgcctccc cgcgttgcgt cgcggtgcat ggagccgggc cacctcgacc tgaatggaag 11100 

ccggcggcac ctcgctaacg gattcaccac tccaagaatt ggagccaatc aattcttgcg 11160 

gagaactgtg aatgcgcaaa ccaacccttg gcagaacata tccatcgcgt ccgccatctc 11220 

cagcagccgc acgcggcgca tctcgggcag cgttgggtcc tgcagatccg gctgtggaat 11280 

gtgtgtcagt tagggtgtgg aaagtcccca ggctccccag caggcagaag tatgcaaagc 11340 

atgcatctca attagtcagc aaccaggtgt ggaaagtccc caggctcccc agcaggcaga 11400 

agtatgcaaa gcatgcatct caattagtca gcaaccatag tcccgcccct aactccgccc 11460 

atcccgcccc taactccgcc cagttccgcc cattctccgc cccatggctg actaattttt 11520 

tttatttatg cagaggccga ggccgcctcg gcctctgagc tattccagaa gtagtgagga 11580 

ggcttttttg gaggcctagg cttttgcaaa a 11611 



<210> 5 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Primer 
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<400> 



5 



cgatgtagga gggcgtggat a 



21 



<210> 6 

<2I1> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 6 

gcttctgcgg gcgatttgtg t 21 

<210> 7 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<210> 8 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 8 

agctcgacat actgttcttc c 21 



<400> 



7 



tgagaatatc accggaattg 



20 



<210> 9 
<211> 24 
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<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> S 

gtgaatggaa atcccatcgc tgtc 24 

<210> 10 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<210> 11 

<211> 1771 

<212> DNA 

<213> Haematococcus pluvialis 
<220> 

<221> CDS 

<222> (166) . . (1155) 

<400> 11 

ggcacgagct tgcacgcaag tcagcgcgcg caagtcaaca cctgccggtc cacagcctca 60 

aataataaag agctcaagcg tttgtgcgcc tcgacgtggc cagtctgcac tgccttgaac 120 

ccgcgagtct cccgccgcac tgactgccat agcacagcta gacga atg cag eta gca 177 



<400> 



10 



agtgggtact etaaaggeca tacc 



24 



Met Gin Leu Ala 



1 



gcg aca gta atg ttg gag cag ctt acc gga age get gag gca etc aag 



225 
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Ala 


Thr 


Val 


Met 


Leu 


Glu 


Gin 


Leu 


Thr 


Gly 


Ser 


Ala 


Glu 


Ala 


Leu 


Lys 




5 










10 










15 










20 




gag 


aag 


gag 


aag 


gag 


gtt 


gca 


ggc 


age 


tct 


gac 


gtg 


ttg 


cgz 


aca 


tgg 


273 


Glu 


Lys 


Glu 


Lys 


Glu 


Val 


Ala 


Gly 


Ser 


Ser 


Asp 


Val 


Leu 


Arg 


Thr 


Trp 












25 










30 










35 






gcg 


acc 


cag 


tac 


teg 


ctt 


ccg 


tea 


gaa 


gag 


tea 


gac 


gcg 


gcc 


cgc 


ccg 


321 


Ala 


Thr 


Gin 


Tyr 


Ser 


Leu 


Pro 


Ser 


Glu 


Glu 


Ser 


Asp 


Ala 


Ala 


Arg 


Pro 










40 










45 










50 








gga 


ctg 


aag 


_ _ ^_ 


gcc 


tac 


aag 


cca 


cca 


cct 


tec 


gac 


aca 


aag 


ggc 


ate 


369 


Gly 


Leu 


Lys 


Asn 


Ala 


Tyr 


Lys 


Pro 


Pro 


Pro 


Ser 


Asp 


Thr 


Lys 


Gly 


He 








55 










60 










65 










aca 


atg 


gcg 


eta 


cgt 


gtc 


ate 


ggc 


tec 


tgg 


gcc 


gca 


gtg 


ttc 


etc 


cac 


417 


Thr 


Met 


Ala 


Leu 


Arg 


Val 


He 


Gly 


Ser 


Trp 


Ala 


Ala 


Val 


Phe 


Leu 


His 






70 










75 










80 












gcc 


att 


ttt 


caa 


ate 


aag 


ctt 


ccg 


acc 


tec 


ttg 


gac 


cag 


ctg 


cac 


tgg 


465 


Ala 


He 


Rhe 


Gin 


He 


Lys 


Leu 


-Pro 


Thr 


Ser 


Leu 


Asp 


Gin 


Leu 


His 


Trp . 




85 










90 










95 










100 




ctg 


ccc 


gtg 


tea 


gat 


gcc 


aca 


get 


cag 


ctg 


gtt 


age 


ggc 


acg 


age 


age 


513 


Leu 


Pro 


Val 


Ser 


Asp 


Ala 


Thr 


Ala 


Gin 


Leu 


Val 


Ser 


Gly 


Thr 


Ser 


Ser 












105 










110 










115 






ctg 


etc 


gac 


ate 


gtc 


gta 


gta 


ttc 


ttt 


gtc 


ctg 


gag 


ttc 


ctg 


tac 


aca 


561 


Leu 


Leu 


Asp 


He 


Val 


Val 


Val 


Phe 


Phe 


Val 


Leu 


Glu 


Phe 


Leu 


Tyr 


Thr 










120 










125 










130 








ggc 


ctt 


ttt 


ate 


acc 


acg 


cat 


gat 


get 


atg 


cat 


ggc 


acc 


ate 


gcc 


atg 


609 


Gly 


Leu 


Phe 


He 


Thr 


Thr 


His 


Asp 


Ala 


Met 


His 


Gly 


Thr 


He 


Ala 


Met 








135 










140 










145 










aga 


aac 


agg 


cag 


ctt 


aat 


gac 


ttc 


ttg 


ggc 


aga 


gta 


tgc 


ate 


tec 


ttg 


657 


Arg 


Asn 


Arg 


Gin 


Leu 


Asn 


Asp 


Phe 


Leu 


Gly 


Arg 


Val 


Cys 


He 


Ser 


Leu 






150 










155 










160 












tac 


gcc 


tgg 


ttt 


gat 


tac 


aac 


atg 


ctg 


cac 


cgc 


aag 


cat 


tgg 


gag 


cac 


705 


Tyr 


Ala 


Trp 


Phe 


Asp 


Tyr 


Asn 


Met 


Leu 


His 


Arg 


Lys 


His 


Trp 


Glu 


His 





165 170 175 180 



BASF NAE 878/03 



31/357 



cac aac cac act ggc gag gtg ggc aag gac cct gac ttc cac acg gga 751 
His Asn His Thr Giy Glu Val Gly Lys Asp Pro Asp Phe His Arg Gly 
185 190 195 



aac cct ggc att gtg ccc tgg ttt gcc age ttc atg tec age tac atg 
Asn Pro Gly lie Val Pro Trp Phe Ala Ser Phe .Met Ser Ser Tyr Met 
200 205 210 



801 



teg atg tgg cag ttt gcg cgc etc gca tgg tgg acg gtg gtc atg. cag 849 
Ser Met Trp Gin Phe Ala Arg Leu Ala Trp Trp Thr Val Val Met Gin 
215 220 225 



ctg ctg ggt 
Leu Leu Gly 
230 

ccc ate ctg 
Pro lie Leu 
245 

cac aag cct 
His Lys Pro 



gcg cca atg 
Ala Pro Met 



tec gcc ttc 
Ser Ala Phe 
250 

gag cct ggc 
Glu Pro Gly 
265 



gcg aac ctg 
Ala Asn Leu 
235 

cgc ttg ttc 
Arg Leu Phe 

gcc gcg tea 
Ala Ala Ser 



ctg gtg ttc 
Leu Val Phe 
240 

tac ttt ggc 
Tyr Phe Gly 
255 

ggc tct tea 
Gly Ser Ser 
270 



atg gcg gcc 
Met Ala Ala 



acg tac atg 
Thr Tyr Met 

cca gcc gtc 
Pro Ala Val 
275 



gcg 897 
Ala 

ccc 945 

Pro 

260 

atg 993 
Met 



aac tgg tgg aag teg cgc .act age cag gcg tec gac ctg gtc age ttt 
Asn Trp Trp Lys Ser Arg Thr Ser Gin Ala Ser Asp Leu Val Ser Phe 
280 285 290 



1041 



ctg ace tgc tac cac ttc gac ctg cac tgg gag cac cac cgc tgg ccc 1089 
Leu Thr Cys Tyr His Phe Asp Leu His Trp Glu His His Arg Trp Pro 
295 300 305 



ttc gcc ccc tgg tgg gag ctg ccc aac tgc cgc cgc ctg tct ggc cga 
Phe Ala Pro Trp Trp Glu Leu Pro Asn Cys Arg Arg Leu Ser Gly Arg 
310 315 320 



1137 



ggt ctg gtt cct gcc tag ctggacacac tgcagtgggc cctgctgcca 1185 

Gly Leu Val Pro Ala 

325- 



gctgggcatg caggttgtgg caggactggg tgaggtgaaa agetgeagge gctgctgccg 124 5 
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gacacgctgc atgggctacc ctgtgtagct gccgccacta ggggaggggg tttgcagctg 1305 

tcgagct Tigc cccatggatg aagctgtgta gtggtgcagg gagtacaccc acaggccaac 1365 

acccttccac gagatgtctt gcgtcgggag gagtgttggg cagtgtagat gctatgattg 1425 

tatcttaatg ctgaagcctt taggggagcg acacttagtg ctgggcaggc aacgccctgc 1485 

aaggtgcagg cacaagctag gctggacgag gactcggtgg caggcaggtg aagaggtgcg 1545 

ggagggtggt gccacaccca ctgggcaaga ccatgctgca atgctggcgg tgtggcagtg 1605 

agagctgcgt gattaactgg gctatggatt gtttgagcag tctcacttat tctttgatat 1665 

agatactggt caggcaggtc aggagagtga gtatgaacaa gttgagaggt ggtgcgctgc 1725 

ccctgcgctt atgaagctgt aacaataaag tggttcaaaa aaaaaa • 1771 



<210> 12 
<211> 329 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 12 

Met Gin Leu Ala Ala Thr Val Met Leu Glu Gin Leu Thr Gly Ser Ala 
15 10 15 



Glu Ala Leu Lys Glu Lys Glu Lys Glu Val Ala Gly Ser Ser Asp Val 
20 25 30 



Leu Arg Thr Trp Ala Thr Gin Tyr Ser Leu Pro Ser Glu Glu Ser Asp 
35 40 45 



Ala Ala Arg Pro Gly Leu Lys Asn Ala Tyr Lys 
50 55 



Pro Pro Pro Ser Asp 
60 
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Thr Lys Gly lie Thr Met Ala Leu Arg Val lie Gly Ser Trp Ala Ala 
65 70 75 SO 



Val Phe Leu His Ala lie Phe Gin lie Lys Leu Pro Thr Ser Leu Asp 
85 90 95 



Gin Leu His Trp Leu Pro Val Ser Asp Ala Thr Ala Gin Leu Val. Ser 
100 105 110 



Gly Thr Ser Ser Leu Leu Asp lie Val Val Val Phe Phe Val Leu Glu 
115 120 125 



Phe Leu Tyr Thr Gly Leu Phe lie Thr Thr His Asp Ala Met His Gly 
130 135 140 



Thr lie Ala Met Arg Asn Arg Gin Leu Asn Asp Phe Leu Gly Arg Val 
145 150 155 160 



Cys lie Ser Leu Tyr Ala Trp Phe Asp Tyr Asn Met Leu His Arg Lys 
165 170 175 



His Trp Glu His His Asn His Thr Gly Glu Val Gly Lys Asp Pro Asp 
180 185 190 



Phe His Arg Gly Asn Pro Gly lie Val Pro Trp Phe Ala Ser Phe Met 
195 200 205 



Ser Ser Tyr Met Ser Met Trp Gin Phe Ala Arg Leu Ala Trp Trp Thr 
210 215 220 
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Val Val Met Gin Leu Leu Gly Ala Pro Met Ala Asn Leu Leu Val Phe 

225 230 235 240 

Met Ala Ala Ala Pro He Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly 

245 250 255 



Thr Tyr Met Pro His Lys Pro Glu Pro Gly Ala Ala Ser Gly Ser Ser 
260 265 270 

Pro Ala Val Met Asn Trp Trp Lys Ser Arg Thr Ser Gin Ala Ser Asp 

275 280 285 

Leu Val Ser Phe Leu Thr Cys Tyr His Phe Asp Leu His Trp Glu His 
290 295 300 



His Arg Trp Pro Phe Ala Pro Trp Trp Glu Leu Pro Asn Cys Arg Arg 
305 310 315 320 

Leu Ser Gly Arg Gly Leu Val Pro Ala 
325 



<210> 13 
<211> 1662 
<212> DNA 

<213> Haematococcus pluvialis 
<220> 

<221> CDS 

<222> (168) . . (1130) 
<400> 13 

cggggcaact caagaaattc aacagctgca agcgcgcccc agcctcacag cgccaagtga 
gctatcgacg tggttgtgag cgctcgacgt ggtccactga cgggcctgtg agcctctgcg 
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ctccgtcctc tgccaaatct cgcgtcgggg cctgcctaag tcqaaga atg cac gtc 176 

Met His Val 
I 

gca teg gca eta atg gtc gag cag aaa ggc agt gag gca get get zee 224 
Ala Ser Ala Leu Met Val Glu Gin Lys Gly Ser Glu Ala Ala Ala Ser 
5 10 15 

age cca gac gtc ttg aga gcg tgg gcg aca cag tat cac atg cca tec 272 
Ser Pro Asp Val Leu Arg Ala Trp Ala Thr Gin Tyr His Met Pro Ser 
20 25 30 35 

gag teg tea gac gca get cgt cct gcg eta aag cac gee tac aaa cct 320 
Glu Ser Ser Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys Pro 
40 45 50 

cca gca tct gac gee aag ggc ate acg atg gcg ctg acc ate att ggc 368 
Pro Ala Ser Asp Ala Lys Gly lie Thr Met Ala Leu Thr lie lie Gly 
55 60 65 

acc tgg acc gca gtg ttt tta cac gca ata ttt caa ate agg eta ccg 416 
Thr Trp Thr Ala Val Phe Leu His Ala lie Phe Gin lie Arg Leu Pro 
70 75 80 

aca tec atg gac cag ctt cac tgg ttg cct gtg tec gaa gee aca gee 464 
Thr Ser Met Asp Gin Leu His Trp Leu Pro Val Ser Glu Ala Thr Ala 
85 90 95 

cag ctt ttg ggc gga age age age eta ctg cac ate get gca gtc ttc 512 
Gin Leu Leu Gly Gly Ser Ser Ser Leu Leu His lie Ala Ala Val Phe 
100 105 110 115 

att gta ctt gag ttc ctg tac act ggt eta ttc ate acc aca cat gac 560 
lie Val Leu Glu Phe Leu Tyr Thr Gly Leu Phe lie Thr Thr His Asp 
120 125 130 

gca atg cat ggc acc ata get ttg agg cac agg cag etc aat gat etc 608 
Ala Met His Gly Thr lie Ala Leu Arg His Arg Gin Leu Asn Asp Leu 
135 140 145 



ctt ggc aac ate tgc ata tea ctg tac gee tgg ttt gac tac age atg 



656 
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Leu Gly Asn 
150 

etc cat; cq c 
Leu His Arg 
165 

aaa gac cct 
Lys Asp Pro 
180 

gec age ttc 
Ala Ser Phe 



gca tgg tgg 
Ala Trp Trp 

etc eta gtc 
Leu Leu Val 
230 

ttc tac ttc 
Phe Tyr Phe 
245 

gca ggc tct 
Ala Gly Ser 
260 

tct gat gtg 
Ser Asp Val 



gag cac cac 
Glu His His 



cgc cgc ctg 
Arg Arg Leu 
310 



He Cys He 

aag cac tgg 
Lys His Trp 

gac ttc cac 
Asp Phe His 
185 

atg tec age 
Met Ser Ser 
200 

gca gtg gtg 
Ala Val Val 
215 

ttc atg get 
Phe Met Ala 



ggc act tac 
Gly Thr Tyr 

cag gtg atg 
Gin Val Met 
265 

atg agt ttc 
Met Ser Phe 
280 

agg tgg ccc 
Arg Trp Pro 
295 

tec ggg cgt 
Ser Gly Arg 



Ser Leu Tyr 
155 

gag cac cac 
Glu His His 
170 

aag gga aat 
Lys Gly Asn 

tac atg tec 
Tyr Met Ser 

atg caa atg 
Met Gin Met 
220 

gca gec cca 
Ala Ala Pro 
235 

ctg cca cac 

Leu Pro His 
250 

gee tgg ttc 

Ala Trp Phe 

ctg aca tgc 
Leu Thr Cys 

ttt gec ccc 
Phe Ala Pro 
300 

ggc ctg gtg 
Gly Leu Val 
315 



Ala Trp Phe 

aac cat act 
Asn His Thr 
175 

ccc ggc ctt 
Pro Gly Leu 
190 

ctg tgg cag 
Leu Trp Gin 
205 

ctg ggg gcg 
Leu Gly Ala 

ate ttg tea 
He Leu Ser 

aag cct gag 
Lys Pro Glu 
255 

agg gee aag 
Arg Ala Lys 
270 

tac cac ttt 
Tyr His Phe 
285 

tgg tgg cag 
Trp Trp Gin 

cct gee ttg 
Pro Ala Leu 



Asp Tyr Ser 
160 

ggc gaa gtg 
Gly Glu Val 



gtc ccc tgg 
Val Pro Trp 



ttt gee egg 
Phe Ala Arg 
210 

ccc atg gca 
Pro Met Ala 
225 

gca ttc cgc 
Ala Phe Arg 
240 

cca ggc cct 
Pro Gly Pro 



aca agt gag 
Thr Ser Glu 

gac ctg cac 
Asp Leu His 
290 

ctg ccc cac 
Leu Pro His 
305 

gca tga 

Ala 

320 



Met 



ggg 704 
Gly 

ttc 752 

Phe 

195 

ctg 800 
Leu 

aat 848 
Asn 

etc 896 
Leu 

gca 944 
Ala 



gca 992 

Ala 

275 

tgg 1040 
Trp 

tgc 1088 
Cys 

1130 
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cctqgtccct ccgctqgtga cccagcgtct gcacaagagt gtcatgctac agcgtgctgc 

ggccagtggc agcgcactgc actctcagcc tgtatggggc taccgctgtg ccacrgagca 

ctgggcatgc cacrgagcac tgggcgtgct actgagcaat gggcgtgcta ctgaccaatg 

ggcgtgctac tgacaatggg cgtgctactg gggtctggca gtggctagga tggagtttga 

tgcattcagt agcggtggcc aacgtcatgt ggatggtgga agtgctgagg ggtttaggca 

gccggcattt gagagggcta agttataaat cgcatgctgc tcatgcgcac atatctgcac 

acagccaggg aaatcccttc gagagtgatt atgggacact tgtattggtt tcgtgctatt 

gttttattca gcagcagtac ttagtgaggg tgagagcagg gtggtgagag tggagtgagt 

gagtatgaac ctggtcagcg aggtgaacag cctgtaatga atgactctgt ct 

<210> 14 
<211> 320 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 14 

Met His Val Ala Ser Ala Leu Met Val Glu Gin Lys Gly Ser Glu Ala 

1 5 10 15 

Ala Ala Ser Ser Pro Asp Val Leu Arg Ala Trp Ala Thr Gin Tyr His 



Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro Ala Leu Lys His Ala 



1190 
1250 
1310 
1370 
1430 
1490 
1550 
1610 
1662 



20 



25 



30 



35 



45 



Tyr Lys Pro Pro Ala Ser Asp Ala Lys Gly lie Thr Met Ala Leu Thr 
50 55 60 
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He He Gly Thr 
65 



Arg Leu Pro Thr 



Ala Thr Ala Gin 
100 



Ala Val Phe He 
115 



Thr His Asp Ala 
130 



Asn Asp Leu Leu 
145 



Tyr Ser Met Leu 



Glu Val Gly Lys 
180 



Pro Trp Phe Ala 
195 



Ala Arg Leu Ala 
210 



Trp Thr Ala Val 
70 



Ser Met Asp Gin 
85 



Leu Leu Gly Gly 



Val Leu Glu Phe 
120 



Met His Gly Thr 
135 



Gly Asn He Cys 
150 



His Arg Lys His 
165 



Asp Pro Asp Phe 



Ser Phe Met Ser 
200 



Trp Trp Ala Val 
215 



Phe Leu His Ala 
75 



Leu His Trp Leu 
90 



Ser Ser Ser Leu 
105 



Leu Tyr Thr Gly 



lie Ala Leu Arg 
140 



He Ser Leu Tyr 
155 



Trp Glu His His 
170 



His Lys Gly Asn 
185 



Ser Tyr Met Ser 



Val Met Gin Met 
220 



He Phe Gin He 
80 



Pro Val Ser Glu 
95 



Leu His He Ala 
110 



Leu Phe lie Thr. 
125 



His Arg Gin Leu 



Ala Trp Phe Asp 
160 



Asn His Thr Gly 
175 



Pro Gly Leu Val 
. 190 



Leu Trp Gin Phe 
205 



Leu Gly Ala Pro 
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Met Ala Asn Leu Leu Val ?he Met Ala Ala Ala Pro lie Leu Ser Ala 

225 230 235 240 

Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Leu Pro His Lys Pro Glu Pro 

245 250 255 



Gly Pro Ala Ala Gly Ser Gin Val Met Ala Trp Phe Arg Ala Lys Thr 
260 265 270 



Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr Cys Tyr His Phe Asp 
275 280 285 

Leu His Trp Glu His His Arg Trp Pro Phe Ala Pro Trp Trp Gin Leu 
290 295 300 

Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 
305 310 315 320 

<210> 15 

<211> 729 

<212> DNA 

<213> Agrobacterium aurantiacum 
<220> 

<221> CDS 

<222> (i) . . (729) 

<400> 15 

atg age gca cat gec ctg ccc aag gca gat ctg acc gec acc age ctg 48 

Met Ser Ala His Ala Leu Pro Lys Ala Asp Leu Thr Ala Thr Ser Leu 
15 10 15 



ate gtc teg 
lie Val Ser 



ggc ggc 
Gly Gly 
20 



ate ate gee get tgg ctg gec ctg cat gtg cat 
lie lie Ala Ala Trp Leu Ala Leu His Val His 
25 30 



96 
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gcg ctg tog ttt ctg cac gca gcg gcg car ccc ate erg gcg ate gca 

Ala Leu Trp Phe Leu Asp Ala Ala Ala His Pro lie Leu Ala lie Ala 

35 40 45 

aat ttc ctg ggg ctg ace tgg ctg teg gtc gga ttg ttc ate ate gcg 

Asn Phe Leu Giy Leu Thr Trp Leu Ser Val Gly Leu Phe lie lie Ala 

50 55 60 

cat gac gcg atg cac ggg teg gtg gtg ccg ggg cgt ccg cgc gee aat 

His Asp Ala Met His Gly Ser Val Val Pro Gly Arg Pro Arg Ala Asn 

65 70 75 80 

gcg gcg atg ggc cag ctt gtc ctg tgg ctg tat gee gga ttt teg tgg 

Ala Ala Met Gly Gin Leu Val Leu Trp Leu Tyr Ala Gly Phe Ser Trp. 

85 90 95 

cgc aag atg ate gtc aag cac atg gee cat cac cgc cat gee gga ace ■ 

Arg Lys Met lie Val Lys His Met Ala His His Arg His Ala Gly Thr 

100 105 110 

gac gac gac ccc ,gat ttc gac cat ggc ggc ccg gtc cgc tgg tac gec 

Asp Asp Asp Pro Asp Phe Asp His Gly Gly Pro Val Arg Trp Tyr Ala 

115 120 125 

cgc ttc ate ggc ace tat ttc ggc tgg cgc gag ggg ctg ctg ctg ccc 

Arg Phe lie Gly Thr Tyr Phe Gly Trp Arg Glu Gly Leu Leu Leu Pro 

130 135 140 

gtc ate gtg acg gtc tat gcg ctg ate ctt ggg gat cgc tgg atg tac 

Val lie Val Thr Val Tyr Ala Leu lie Leu Gly Asp Arg Trp Met Tyr 

145 150 155 160 

gtg gtc ttc tgg ccg ctg ccg teg ate ctg gcg teg ate cag ctg ttc 

Val Val Phe Trp Pro Leu Pro Ser lie Leu Ala Ser lie Gin Leu Phe 

165 170 175 

gtg ttc ggc ace tgg ctg ccg cac cgc ccc ggc cac gac gcg ttc ccg 

Val Phe Gly Thr Trp Leu Pro His Arg Pro Gly His Asp Ala Phe Pro 

180 185 190 



gac cgc cac aat gcg egg teg teg egg ate age gac ccc gtg teg ctg 
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Asp Arg His Asn Ala Arg Ser Ser Arg lie Ser Asp Fro Val Ser Leu 
195 200 205 

ctg acc tgc ttt cac ttt ggc ggt tat cat cac gaa cac cac ctg cac 

Leu Thr Cys Phe His Phe Gly Gly Tyr His His Glu His His Leu His 
210 215 220 

ccg acg gtg ccg tgg tgg cgc ctg ccc age acc cgc acc aag ggg gac 

Pro Thr Val Pro Trp Trp Arg Leu Pro Ser Thr Arg Thr Lys Gly Asp 
225 230 235 . 240 

acc gca tga 
Thr Ala 



<210> 16 

<211> 242 

<212> PRT 

<213> Agrobacterium aurant iacum 

<400> 16 



Met Ser Ala His Ala Leu Pro Lys Ala Asp Leu Thr Ala Thr Ser Leu 
1 5 10 15 



He Val Ser Gly Gly He He Ala Ala Trp Leu Ala Leu His Val His 
20 25 30 



Ala Leu Trp Phe Leu Asp Ala Ala Ala His Pro He Leu Ala He Ala 
35 40 45 



Asn Phe Leu Gly Leu Thr Trp Leu Ser Val Gly Leu Phe He He Ala 
50 55 60 



His Asp Ala Met His Gly Ser Val Val Pro Gly Arg Pro Arg Ala Asn 
65 70 75 80 
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Ala Ala Met Gly Gin Leu Val Leu Trp Leu Tyr Ala Giy ?he Ser Trp 
85 90 95 



Arg Lys Met lie Val Lys His Met Ala His His Arg His Ala Gly Thr 
100 105 110 



Asp Asp Asp- Pro Asp Phe Asp His Gly Gly Pro Val Arg Trp Tyr Ala 
115 120 125 



Arg Phe lie Gly Thr Tyr Phe Gly Trp Arg Glu Gly Leu Leu Leu Pro. 
130 135 140 



Val lie Val Thr Val Tyr Ala Leu lie Leu Gly Asp Arg Trp Met Tyr 
145 150 155 160 



Val Val Phe Trp Pro Leu Pro Ser lie Leu Ala Ser lie Gin Leu Phe 
165 170 175 



Val Phe Gly Thr Trp Leu Pro His Arg Pro Gly His Asp Ala Phe Pro 
180 185 190 



Asp Arg His Asn Ala Arg Ser Ser Arg lie Ser Asp Pro Val Ser Leu 
195 200 205 



Leu Thr Cys Phe His Phe Gly Gly Tyr His His Glu His His Leu His 
210 215 220 



Pro Thr Val Pro Trp Trp Arg Leu Pro Ser Thr Arg Thr Lys Gly Asp 
225 230 235 240 
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<210> 17 

<2il> 1651 

<212> DNA 

<213> Alcaligenes sp. 



<220> 

<221> CDS 

<222> (99) . . (827) 

<400> 17 

ctgcaggccg ggcccggtgg ccaatggtcg caaccggcag gactggaaca ggacggcggg 60 

ccggtctagg ctgtcgccct acgcagcagg agtttcgg atg tec gga egg aag cct 116 

Met Ser Gly Arg Lys Pro 
1 5 

ggc aca act ggc gac acg ate gtc aat etc ggt ctg ace gee gcg ate 164 
Gly Thr Thr Gly Asp Thr lie Val Asn Leu Gly Leu Thr Ala Ala lie 
10 15 20 



ctg ctg tgc tgg ctg gtc ctg cac gec ttt acg eta tgg ttg eta gat 212 

Leu Leu Cys Trp Leu Val Leu His Ala Phe Thr Leu Trp Leu Leu Asp 

25 30 35 

gcg gec gcg cat ccg ctg ctt gec gtg ctg tgc ctg get ggg ctg ace 260 

Ala Ala Ala His Pro Leu Leu Ala Val Leu Cys Leu Ala Gly Leu Thr 

40 45 50 



tgg ctg teg gtc ggg ctg ttc ate ate gcg cat gac gca atg cac ggg 
Trp Leu Ser Val Gly Leu Phe lie lie Ala His Asp Ala Met His Gly 
55 60 65 70 



308 



tec gtg gtg ccg ggg egg ccg cgc gee aat gcg gcg ate ggg caa ctg 356 
Ser Val Val Pro Gly Arg Pro Arg Ala Asn Ala Ala lie Gly Gin Leu 
75 80 85 



gcg ctg tgg etc tat gcg ggg ttc teg tgg ccc aag ctg ate gec aag 



404 
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Ala Leu Trp Leu Tyr Ala Gly Phe Ser Trp Pro Lys Leu lie Ala Lys 
90 95 100 

cac atg acg cat cac egg cac gec ggc acc gac aac gat ccc gar ttc 452 
His Met Thr His His Arg His Ala Gly Thr Asp Asn A.sp Pro Asp Phe 
105 110 115 

ggt cac gga ggg ccc gtg cgc tgg tac ggc age ttc gtc tec acc tat 500 
Gly His Gly Gly Pro Val Arg Trp Tyr Gly Ser Phe Val Ser Thr Tyr 
120 125 130 

ttc ggc tgg cga gag gga ctg ctg eta ccg gtg ate gtc acc acc tat 548 
Phe Gly Trp Arg Glu Gly Leu Leu Leu Pro Val lie Val Thr Thr Tyr 
135 140 145 150 

gcg ctg ate ctg ggc gat cgc tgg atg tat gtc ate ttc tgg ccg gtc 596 
Ala Leu lie Leu Gly Asp Arg Trp Met Tyr Val lie Phe Trp Pro Val 
155 160 165 

ccg gee gtt ctg gcg teg ate cag att ttc gtc ttc gga act tgg ctg 644 
Pro Ala Val Leu Ala Ser lie Gin lie Phe Val Phe Gly Thr Trp Leu 
170 175 180 

ccc cac cgc ccg gga cat gac gat ttt ccc gac egg cac aac gcg agg 692 
Pro His Arg Pro Gly His Asp Asp Phe Pro Asp Arg His Asn Ala Arg 
185 190 195 

teg acc ggc ate ggc gac ccg ttg tea eta ctg acc tgc ttc cat ttc 740 
Ser Thr Gly lie Gly Asp Pro Leu Ser Leu Leu Thr Cys Phe His Phe 
200 205 210 

ggc ggc tat cac cac gaa cat cac ctg cat ccg cat gtg ccg tgg tgg 788 
Gly Gly Tyr His His Glu His His Leu His Pro His Val Pro Trp Trp 
215 220 225 230 

cgc ctg cct cgt aca cgc aag acc gga ggc cgc gca tga cgcaattcct 837 
Arg Leu Pro Arg Thr Arg Lys Thr Gly Gly Arg Ala 
235 240 

cattgtcgtg gcgacagtcc tcgtgatgga gctgaccgcc tattccgtcc acegctggat - 897 



tatgeaegge cccctaggct ggggctggca caagtcccat cacgaagagc acgaccacgc 957 
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gttggagaag aacgacctct acggcgtcgt cttcgcggtg ctggcgacga tcctcttcac 1017 

cgtgggcgcc tattggtggc cggtgctgtg gtggatcccc ctgggcatga cggtctatgg 1077 

gttgatctat ttcatcctgc acgacgggct tgtgcatcaa cgctggccgt ttcggtatat 1137 

tccgcggcgg ggctatttcc gcaggctcta ccaagctcat cgcctgcacc acgcggtcga 1197 

ggggcgggac cactgcgtca gcttcggctt catctatgcc ccacccgtgg acaagctgaa 1257 

gcaggatctg aagcggtcgg gtgtcctgcg cccccaggac gagcgtccgt cgtgatctct 1317 

gatcccggcg tggccgcatg aaatccgacg tgctgctggc aggggccggc cttgccaacg 1377 

gactgatcgc gctggcgatc cgcaaggcgc ggcccgacct tcgcgtgctg ctgctggacc 1437 

gtgcggcggg cgcctcggac gggcatactt ggtcctgcca cgacaccgat ttggcgccgc 14 97 

actggctgga ccgcctgaag ccgatcaggc gtggcgactg gcccgatcag gaggtgcggt 1557 

tcccagacca ttcgcgaagg ctccgggccg gatatggctc gatcgacggg cgggggctga 1617 

tgcgtgcggt gacc 1631 



<210> 18 
<211> 242 
<212> PRT 

<213> Alcaligenes sp. 
<400> 18 

Met Ser Gly Arg Lys Pro Gly Thr Thr Gly Asp Thr lie Val Asn Leu 
15 10 15 



Gly Leu Thr Ala Ala lie Leu Leu Cys Trp Leu 
20 25 



Val Leu His Ala Phe 
30 
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Thr Leu Trp Leu 
35 



Cys Leu Ala Gly 
50 



His Asp Ala Met 
65 



Ala Ala lie Gly 



Pro Lys Leu lie 
100 



Asp Asn Asp Pro 
115 



Ser Phe Val Ser 
130 



Val He Val Thr 
145 



Val He Phe Trp 



Val Phe Gly Thr 
180 



Leu Asp Ala Ala 
4 0 



Leu Thr Trp Leu 
55 



His Gly Ser Val 
70 



Gin Leu Ala Leu 
85 



Ala Lys His Met 



Asp Phe Gly His 
120 



Thr Tyr Phe Gly 
135 



Thr Tyr Ala Leu 
150 



Pro Val Pro Ala 
165 



Trp Leu Pro His 



Ala His Pro Leu 



Ser Val Gly Leu 
60 



Val Pro Gly Arg 
75 



Trp Leu Tyr Ala 
90 



Thr His His Arg 
105 



Gly Gly Pro Val 



Trp Arg Glu Gly 
140 



He Leu Gly Asp 
155 



Val Leu Ala Ser 
170 



Arg Pro Gly His 
185 



Leu Ala Val Leu 
45 



Phe lie He Ala 



Pro Arg Ala Asn 
80 



Gly Phe Ser Trp 
95 



His Ala Gly Thr 
110 



Arg Trp Tyr Gly 
125 



Leu Leu Leu Pro 



Arg Trp Met Tyr 
160 



He Gin lie Phe 
175 



Asp Asp Phe Pro 
190 



Asp Arg His Asn Ala Arg Ser Thr Gly He Gly Asp Pro Leu Ser Leu 
195 200 205 
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Leu Thr Cys Phe His Phe Gly Gly Tyr His His Glu His His Leu His 
210 215 220 



Pro His Val Pro Trp Trp Arg Leu Pro Arg Thr Arg Lys Thr Gly Gly 
225 230 235 240 



Arg Ala 



<210> 19 

<211> 729 

<212> DNA 

<213> Paracoccus marcusii 



<220> 

<221> CDS ' 
<222> (1) . . (729) 

<400> 19 

atg age gca cat gec ctg ccc aag gca gat ctg acc gec aca age ctg 

Met Ser Ala His Ala Leu Pro Lys Ala Asp Leu Thr Ala Thr Ser Leu 
15 10 15 

ate gtc teg ggc ggc ate ate gee gca tgg ctg gee ctg cat gtg cat 
He Val Ser Gly Gly He He Ala Ala Trp Leu Ala Leu His Val His 
20 25 30 

gcg ctg tgg ttt ctg gac gcg gcg gec cat ccc ate ctg gcg gtc gcg 
Ala Leu Trp Phe Leu Asp Ala Ala Ala His Pro He Leu Ala Val Ala 
35 40 45 

aat ttc ctg ggg ctg acc tgg ctg teg gtc gga ttg ttc ate ate gcg 
Asn Phe Leu Gly Leu Thr Trp Leu Ser Val Gly Leu Phe He He Ala 
50 55 60 



cat gac gcg atg cac ggg teg gtc gtg ccg ggg cgt ccg cgc gee aat 
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His Asp Ala Met His Gly Ser Vai Val Pro Gly Arg Pro Arg Ala Asn 
65 70 75 80 



gcg gcg azg 
Ala Ala Met 



cgc aag atg 
Arg Lys Met 

gac gac gac 
Asp Asp Asp 
115 

cgc ttc ate 
Arg Phe lie 
130 

gtc ate gtg 
Val He Val 
145 



ggc cag ctt 
Gly Gin Leu 
85 

ate gtc aag 
He Val Lys 
100 

cca gat ttc 
Pro Asp Phe 

ggc acc tat 
Gly Thr Tyr 

acg gtc tat 
Thr Val Tyr 
150 



gtc ctg tgg 
Val Leu Trp 



cac atg gec 
His Met Ala 
105 

gac cat ggc 
Asp His Gly 
120 

ttc ggc tgg 
Phe Gly Trp 
135 

gcg ctg ate 
Ala .Leu He 



ctg tat gee 
Leu Tyr Ala 
90 

cat cac cgc 
His His Arg 

ggc ccg gtc 
Gly Pro Val 



cgc gag ggg 
Arg Glu Gly 
140 

ctg ggg gat 
Leu Gly Asp 
155 



gga ttt teg 
Gly Phe Ser 
95 

cat gee gga 
His Ala Gly 
110 

cgc tgg tac 
Arg Trp Tyr 
125 

ctg ctg . ctg 
Leu Leu Leu 

cgc tgg atg 
Arg Trp Met 



tgg 2SS 
Trp 

acc 336 
Thr 

gee 384 
Ala 

ccc 432 
Pro 

tac 480 

Tyr 

160 



gtg gtc ttc tgg ccg ttg ccg teg ate ctg gcg teg ate cag ctg ttc 528 
Val Val Phe Trp Pro Leu Pro Ser He Leu Ala Ser He Gin Leu Phe 
165 170 175. 

gtg ttc ggc act tgg ctg ccg cac cgc ccc ggc cac gac gcg ttc ccg 576 
Val Phe Gly Thr Trp Leu Pro His Arg Pro Gly His Asp Ala Phe Pro 
180 185 190 



gac cgc cat aat gcg egg teg teg egg ate age gac cct gtg teg ctg 624 

Asp Arg His Asn Ala Arg Ser Ser Arg He Ser Asp Pro Val Ser Leu 
195 200 205 

ctg acc tgc ttt cat ttt ggc ggt tat cat cac gaa cac cac ctg cac 672 

Leu Thr Cys Phe His Phe Gly Gly Tyr His His Glu His His Leu His 
210 215 220 



ccg acg gtg ccg tgg tgg cgc ctg ccc age acc cgc acc aag ggg gac 
Pro Thr Val Pro Trp Trp Arg Leu Pro Ser Thr Arg Thr Lys Gly Asp 
225 230 235 240 



720 
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acc g c a t g a 
Thr Ala 



<210> 20 

<211> 242 

<212> PRT 

<213> Paracoccus marcusii 

<400> 20 

Met Ser Ala His Ala Leu Pro Lys Ala Asp Leu Thr Ala Thr Ser Leu 
1 5 10 15 



lie Val Ser Gly Gly He He Ala Ala Trp Leu Ala Leu His Val His 
20 25 30 



Ala Leu Trp Phe Leu Asp Ala Ala Ala His Pro He Leu Ala Val Ala 
35 40 45 



Asn Phe Leu Gly Leu Thr Trp Leu Ser Val Gly Leu Phe He He Ala 
50 55 60 



His Asp Ala Met His Gly Ser Val Val Pro Gly Arg Pro Arg Ala Asn 
65 70 75 80 



Ala Ala Met Gly Gin Leu Val Leu Trp Leu Tyr Ala Gly Phe Ser Trp 
85 90 95 



Arg Lys Met He Val Lys His Met Ala His His Arg His Ala Gly Thr 
100 105 110 



Asp Asp Asp Pro Asp Phe Asp His Gly Gly Pro Val Arg Trp Tyr Ala 
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115 120 125 



Arg Phe lie Gly Thr .Tyr Phe Gly Trp Arg Glu Gly Leu Leu Leu Pro 
130 135 140 



Val lie Val Thr Val Tyr Ala Leu lie Leu Gly Asp Arg Trp Met Tyr 
145 150 155 160 



Val Val Phe Trp Pro Leu Pro Ser lie Leu Ala Ser lie Gin Leu Phe 
165 170 175 



Val Phe Gly Thr Trp Leu Pro His Arg Pro Gly His Asp Ala . Phe Pro 
180 185 190 



Asp Arg His Asn Ala Arg Ser Ser Arg lie Ser Asp Pro Val Ser Leu 
195 200 205 



Leu Thr Cys Phe His Phe Gly Gly Tyr His His Glu His His Leu His 
210 215 220 



Pro Thr Val Pro Trp Trp Arg Leu Pro Ser Thr Arg Thr Lys Gly Asp 
225 230 235 240 



Thr Ala 



<210> 21 

<211> 1629 

<212> DNA 

<213> Synechocystis sp. 
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<220> 

<22I> CDS 

<222> (1) . . (1629) 

<400> 21 

ate ate acc acc gat gtt gtc att att ggg gcg ggg cac aat ggc tta 4 S 

Met lie Thr Thr Asp Val Val lie lie Gly Ala Gly His Asn Gly Leu 
1 5 10 15 

gtc tgt gca gec tat ttg etc caa egg ggc ttg ggg gtg acg tta eta- 96 
Val Cys Ala Ala Tyr Leu Leu Gin Arg Gly Leu Gly Val Thr Leu Leu 
20 25 30 

gaa aag egg gaa gta cca ggg ggg gcg gee acc aca gaa get etc atg 144 
Glu Lys Arg Glu Val Pro Gly Gly Ala Ala Thr Thr Glu Ala Leu Met.. 
35 40 45 

ccg gag eta tec ccc cag ttt cgc ttt aac cgc tgt gee att gac cac • 192 
Pro Glu Leu Ser Pro Gin Phe Arg Phe Asn Arg Cys Ala lie Asp His 
50 55 60 

gaa ttt ate ttt ctg ggg ccg gtg ttg cag gag eta aat tta gee cag 240 
Glu Phe lie Phe Leu Gly Pro Val Leu Gin Glu Leu Asn Leu Ala Gin 
65 70 75 80 



tat ggt ttg gaa tat tta ttt tgt gac ccc agt gtt ttt tgt ccg ggg 288 

Tyr Gly Leu Glu Tyr Leu Phe Cys Asp Pro Ser Val Phe Cys Pro Gly 

85 90 95 

ctg gat ggc caa get ttt atg age tac cgt tec eta gaa aaa acc tgt 336 

Leu Asp Gly Gin Ala Phe Met Ser Tyr Arg Ser Leu Glu Lys Thr Cys 

100 105 110 

gee cac att gee acc tat age ccc cga gat gcg gaa aaa tat egg caa 384 

Ala His lie Ala Thr Tyr Ser Pro Arg Asp Ala Glu Lys Tyr Arg Gin 

115 120 125 

ttt gtc aat tat tgg acg gat ttg etc aac get gtc cag cct get ttt 432 

Phe Val Asn Tyr Trp Thr Asp Leu Leu Asn Ala Val Gin Pro Ala Phe 

130 135 140 
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aat get ccg ccc cag get tta eta gat tta gee ctg aac tat cat: tgg 

Asn Ala Pro Pro Gin Ala Leu Leu Asp Leu Ala Leu Asn Tyr Gly Trp 

145 150 155 160 

gaa aac tta aaa tec gtg ctg gcg ate gec ggg teg aaa acc aag gcg 

Glu Asn Leu Lys Ser Val Leu Ala lie Ala Gly Ser Lys Thr Lys Ala 

165 170 175 

ttg gat ttt ate cgc act atg ate ggc tec ccg gaa gat gtg etc aat 

Leu Asp Phe lie Arg Thr Met lie Gly Ser Pro Glu Asp Val Leu Asn 

180 185 190 

gaa tgg ttc gac age gaa egg gtt aaa get cct tta get aga eta tgt 

Glu Trp Phe Asp Ser Glu Arg Val Lys Ala Pro Leu Ala Arg Leu Cys 

195 200 205 



teg gaa att ggc get ccc cca tec caa aag ggt agt age tec ggc atg 

Ser Glu lie Gly Ala Pro Pro Ser Gin Lys Gly Ser Ser Ser Gly Met 

210 215 220 

atg atg gtg gee atg egg cat ttg gag gga att gee aga cca aaa gga 

Met Met Val Ala Met Arg His Leu Glu Gly lie Ala Arg Pro Lys Gly 

225 230 235 240 

ggc act gga gee etc aca gaa gee ttg gtg aag tta gtg caa gee caa 

Gly Thr Gly Ala Leu Thr Glu Ala Leu Val Lys Leu Val Gin Ala Gin 

245 250 255 

ggg gga aaa ate etc act gac caa acc gtc aaa egg gta ttg gtg gaa 

Gly Gly Lys lie Leu Thr Asp Gin Thr Val Lys Arg Val Leu Val Glu 

260 265 270 

aac aac cag gcg ate ggg gtg gag gta get aac gga gaa cag tac egg 

Asn Asn Gin Ala lie Gly Val Glu Val Ala Asn Gly Glu Gin Tyr Arg 

275 280 285 

gee aaa aaa ggc gtg att tct aac ate gat gee cgc cgt tta ttt ttg 

Ala Lys Lys Gly Val lie Ser Asn lie Asp Ala Arg Arg Leu Phe Leu 

290 295 300 



caa ttg gtg gaa ccg ggg gee eta gee aag gtg aat caa aac eta ggg 
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Glu Phe lie Phe Leu Gly Pro Val Leu Gin Glu Leu As n Leu Ala Gin 
65 70 75 80 



Tyr Gly Leu Glu Tyr Leu Phe Cys Asp Pro Ser Val Phe- Cys Pro Gly 
85 90 95 



Leu Asp Gly Gin Ala Phe Met Ser Tyr Arg Ser Leu Glu Lys Thr Cys 
100 105 110 



Ala His lie Ala Thr Tyr Ser Pro Arg Asp Ala Glu Lys Tyr Arg Gin. 
115 120 125 



Phe Val Asn Tyr Trp Thr Asp Leu Leu Asn Ala Val Gin Pro Ala Phe 
130 135 140 



Asn Ala Pro Pro Gin Ala Leu Leu Asp Leu Ala Leu Asn Tyr Gly Trp 
145 150 155 160 



Glu Asn Leu Lys Ser Val Leu Ala lie Ala Gly Ser Lys Thr Lys Ala 
165 170 175 



Leu Asp Phe lie Arg Thr Met lie Gly Ser Pro Glu Asp Val Leu Asn 
180 185 190 



Glu Trp Phe Asp Ser Glu Arg Val Lys Ala Pro Leu Ala Arg Leu Cys 
195 200 205 



Ser Glu lie Gly Ala Pro Pro Ser Gin Lys Gly Ser Ser Ser Gly Met 
210 215 220 
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Met Met Val Ala 
225 



Gly Thr Gly Ala 



Gly Gly Lys lie 
260 



Asn Asn Gin Ala 
275 



Ala Lys Lys Gly 
290 



Gin Leu Val Glu 
305 



Glu Arg Leu Glu 



lie Asp Cys Ala 
340 



Pro Glu Asp Leu 
355 



Val Glu Glu Ala 
370 



Met Arg His Leu 
230 



Leu Thr Glu Ala 
245 



Leu Thr Asp Gin 



He Gly Val Glu 
280 



Val He Ser Asn 
295 



Pro Gly Ala Leu 
310 



Arg Arg Thr Val 
325 



Leu Ser Gly Leu 



Thr Gly Thr He 
360 



His Ala Leu He 
375 



Glu Gly lie Ala 
235 



Leu Val Lys Leu 
250 



Thr Val Lys Arg 
265 



Val Ala Asn Gly 



He Asp Ala Arg 
300 



Ala Lys Val Asn 
315 



Asn Asn Asn Glu 
330 



Pro His Phe Thr 
345 



Leu He Ala Asp 



Ala Leu Gly Gin 
380 



Arg Pro Lys Gly 
240 



Val Gin Ala Gin 
255 



Val Leu Val Glu 
270 



Glu Gin Tyr Arg 
285 



Arg Leu Phe Leu 



Gin Asn Leu Gly 
320 



Ala He Leu Lys 
335 



Ala Met Ala Gly 
350 



Ser Val Arg His 
365 



lie Pro Asp Ala 



Asn Pro Ser Leu Tyr Leu Asp lie Pro Thr Val Leu Asp Pro Thr Met 
385 390 395 400 
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Ala Pro Pro Gly Gin His Thr Leu Trp lie Giu Phe Phe Ala Pro Tyr 
405 410 415 



Arg lie Ala Gly Leu Glu Gly Thr Gly Leu Met Gly Thr Gly Trp Thr 
420 425 430 



Asp Glu Leu Lys Glu Lys Val Ala Asp Arg Val lie Asp Lys Leu Thr 
435 440 445 



Asp Tyr Ala Pro Asn Leu Lys Ser Leu lie lie Gly Arg Arg Val Glu 
450 455 460 



Ser Pro Ala Glu Leu Ala Gin Arg Leu Gly Ser Tyr Asn Gly Asn Val 
465 470 475 480 



Tyr His Leu Asp Met Ser Leu Asp Gin Met Met Phe Leu Arg Pro Leu 
485 490 495 



Pro Glu lie Ala Asn Tyr Gin Thr Pro He Lys Asn Leu Tyr Leu Thr 
500 505 510 



Gly Ala Gly Thr His Pro Gly Gly Ser He Ser Gly Met Pro Gly Arg 
515 520 525 



Asn Cys Ala Arg Val Phe Leu Lys Gin Gin Arg Arg Phe Trp 
530 535 540 



<210> 23 

<21i> 776 

<212> DNA 

<213> Bradyrhizobium sp. 
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<220> 

<221> CDS 

<222> (1) . . (774 ) 



<400> 23 

atg cat gca gca 

Met His Ala Ala 

1 

gac gat gcg agg 
Asp Asp Ala Arg 
20 

ate gec gec tgg 
lie Ala Ala Trp 
35 



acc gec aag get 
Thr Ala Lys Ala 
5 

cag cgc cgc gtc 
Gin Arg Arg Val 

ctg gtg ctg cat 
Leu Val Leu His 
40 



act gag ttc ggg 
Thr Glu Phe Gly 
10 

ggt etc acg ctg 
Gly Leu Thr Leu 
25 

gtc ggt ctg atg 
Val Gly Leu Met 



gee tct egg cgc 
Ala Ser Arg Arg 
15 . 

gee gcg gtc ate 
Ala Ala Val lie 
30 

ttc ttc tgg ccg 
Phe Phe Trp Pro 
45 



ctg acc ctt cac age ctg ctg ccg get ttg . cct ctg gtg gtg ctg cag 
Leu Thr Leu His Ser Leu Leu Pro Ala Leu Pro Leu Val Val Leu Gin 
50 55 60 



acc tgg etc tat gta ggc ctg ttc ate ate gcg cat gac tgc atg cac 
Thr Trp Leu Tyr Val Gly Leu Phe lie lie Ala His Asp Cys Met His 
65 70 75 80 



ggc teg ctg gtg ccg ttc aag ccg cag gtc aac cgc cgt ate gga cag 
Gly Ser Leu Val Pro Phe Lys Pro Gin Val Asn Arg Arg lie Gly Gin 
85 90 95 



etc tgc ctg ttc etc tat gee ggg ttc tec ttc gac get etc aat gtc 
Leu Cys Leu Phe Leu Tyr Ala Gly Phe Ser Phe Asp Ala Leu Asn Val 
100 105 110 



gag cac cac aag cat cac cgc cat ccc ggc acg gee gag gat ccc gat 
Glu His His Lys His His Arg His Pro Gly Thr Ala Glu Asp Pro Asp 
115 120 125 



ttc gac gag gtg ccg ccg cac ggc ttc tgg cac tgg ttc gee age ttt 
Phe Asp Glu Val Pro Pro His Gly Phe Trp His Trp Phe Ala Ser Phe 
130 135 140 
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480 



528 



576 



ttc ctg cac tat ttc ggc tgg aag cag gtc gcg ate ate gca gec etc 

Phe Leu His Tyr Phe Gly Trp Lys Gin Vai Ala lie lie Ala Ala Val 
145 150 155 160 

teg ctg gtt tat cag etc gtc ttc gee gtt ccc ttg cag aac ate ctg 

Ser Leu Val Tyr Gin Leu Val Phe Ala Val Pro Leu Gin Asn lie Leu 
165 170 175 

ctg ttc tgg gcg ctg ccc ggg ctg ctg teg gcg ctg cag ctg ttc ace 

Leu Phe Trp Ala Leu Pro Gly Leu Leu Ser Ala Leu Gin Leu Phe Thr 
180 185 190 

ttc ggc ace tat ctg ccg cac aag ccg gee acg cag ccc ttc gee gat 624 

Phe Gly Thr Tyr Leu Pro His Lys Pro Ala Thr Gin Pro Phe Ala Asp • 

195 200 205 

cgc cac aac gcg egg acg age gaa ttt ccc gcg tgg ctg teg ctg ctg 672 

Arg His Asn Ala Arg Thr Ser Glu Phe Pro Ala Trp Leu Ser Leu Leu 
210 215 220 

ace tgc ttc* cac ttc ggc ttt cat cac gag cat cat ctg cat ccc gat 
Thr Cys Phe His Phe Gly Phe His His Glu His His Leu His Pro Asp 
225 230 235 240 

gcg ccg tgg tgg egg ctg ccg gag ate aag egg egg gee ctg gaa agg 
Ala Pro Trp Trp Arg Leu Pro Glu lie Lys Arg Arg Ala Leu Glu Arg 
245 250 255 

cgt gac ta 776 
Arg Asp 



720 



768 



<210> 24 

<211> 258 

<212> PRT 

<213> Bradyrhizobium sp. 

<400> 24 



Met His Ala Ala Thr Ala Lys Ala Thr Glu Phe Gly Ala Ser Arg Arg 
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10 



15 



Asp Asp Ala Arg Gin Arg Arg Val Gly Leu Thr Leu Ala Ala Val lie 
20 25 30 



lie Ala Ala Trp Leu Val Leu His Val Gly Leu Met Phe Phe Trp Pro 
35 40 45 



Leu Thr Leu His Ser Leu Leu Pro Ala Leu Pro Leu Val Val Leu Gin 
50 55 60 



Thr Trp Leu Tyr Val Gly Leu Phe lie lie Ala His Asp Cys Met His 
65 70 75 80 



Gly Ser Leu Val Pro Phe Lys Pro Gin Val Asn Arg Arg lie Gly Gin 
85 90 95 



Leu Cys Leu Phe Leu Tyr Ala Gly Phe Ser Phe Asp Ala Leu Asn Val 
100 105 110 



Glu His His Lys His His Arg His Pro Gly Thr Ala Glu Asp Pro Asp 
115 120 125 



Phe Asp Glu Val Pro Pro His Gly Phe Trp His Trp Phe Ala Ser Phe 
130 135 140 



Phe Leu His Tyr Phe Gly Trp Lys Gin Val Ala lie lie Ala Ala Val 
145 150 155 160 



Ser Leu Val Tyr Gin Leu Val Phe Ala Val Pro Leu Gin Asn lie Leu 
165 170 175 
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Leu Phe Trp Ala Leu Pro Gly Leu Leu Ser Ala Leu Gin Leu Phe Thr 
180 185 190 



Phe Gly Thr Tyr Leu Pro His Lys Pro Ala Thr Gin Pro Phe Ala Asp 
195 200 205 

Arg His Asn Ala Arg Thr Ser Glu Phe Pro Ala Trp Leu Ser Leu Leu' 
210 215 220 

Thr Cys Phe His Phe Gly Phe His His Glu His His Leu His Pro Asp 
225 230 235 240- 

Ala Pro Trp Trp Arg Leu Pro Glu lie Lys Arg Arg Ala Leu Glu Arg 
245 250 255 



Arg Asp 



<210> 25 

<211> 777 

<212> DNA 

<213> Nostoc sp. 



<220> 

<221> CDS 

<222> (1) . . (777) 

<400> 25 

atg gtt cag tgt caa cca tea tct ctg cat tea gaa aaa ctg gtg tta 

Met Val Gin Cys Gin Pro Ser Ser Leu His Ser Glu Lys Leu Val Leu 
15 10 15 

ttg tea teg aca ate aga gat gat aaa aat att aat aag ggt ata ttt 
Leu Ser Ser Thr He Arg Asp Asp Lys Asn He Asn Lys Gly He Phe 
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20 25 30 

att gcc tgc ttt ate tta ttt tta tgg gca att agt tta arc tta tta 144 

lie Ala Cys Phe lie Leu Phe Leu Trp Ala lie Ser Leu lie Leu Leu 

35 40 45 

etc tea ata gat aca tec ata att cat aag age tta tta ggt ata gcc 192 

Leu Ser He Asp Thr Ser He He His Lys Ser Leu Leu Gly He Ala 
50 55 60 



atg ctt tgg cag ace ttc tta tat aca ggt tta ttt att act get cat 
Met Leu Trp Gin Thr Phe Leu Tyr Thr Gly Leu Phe He Thr Ala His 
65 . 70 75 80 

gat gcc atg cac ggc gta gtt tat ccc aaa aat ccc aga ata aat aat 
.Asp Ala Met His Gly Val Val Tyr Pro Lys Asn Pro Arg He Asn Asn 
85 90 95 

ttt ata ggt aag etc act eta ate ttg tat gga eta etc cct tat aaa 
Phe He Gly Lys Leu Thr Leu He Leu Tyr Gly Leu Leu Pro Tyr Lys 
100 105 . HO 

gat tta ttg aaa aaa cat tgg tta cac cac gga cat cct ggt act gat 
Asp Leu Leu Lys Lys His Trp Leu His His Gly His Pro Gly Thr Asp 
115 120 125 

tta gac cct gat tat tac aat ggt cat ccc caa aac ttc ttt ctt tgg 
Leu Asp Pro Asp Tyr Tyr Asn Gly His Pro Gin Asn Phe Phe Leu Trp 
130 135 140 

tat eta cat ttt atg aag tct tat tgg cga tgg acg caa att ttc gga 
Tyr Leu His Phe Met Lys Ser Tyr Trp Arg Trp Thr Gin lie Phe Gly 
145 150 155 160 

tta gtg atg att ttt cat gga ctt aaa aat ctg gtg cat ata cca gaa 
Leu Val Met lie Phe His Gly Leu Lys Asn Leu Val His He Pro Glu 
165 170 175 

aat aat tta att ata ttt tgg atg ata cct tct att tta agt tea gta 
Asn Asn Leu lie lie Phe Trp Met lie Pro Ser lie Leu Ser Ser Val 
180 185 190 



240 



238 



336 



384 



432 



480 



528 



576 
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caa eta ttt tat ttt ggt aca ttt ttg cct cat aaa aag eta g.aa ggt 624 
Gin Leu Phe Tyr Phe Gly Thr ?he Leu Pro His Lys Lys Leu Glu Gly 
135 200 205 

ggt tat act aac ccc cat tgt gcg cgc agt ate cca tta cct ctt ttt. 672 
Gly Tyr Thr Asn Pro His Cys Ala Arg Ser lie Pro Leu Pro Leu Phe 
210 215 220 



tgg tct ttt gtt act tgt tat cac ttc ggc tac cac aag gaa cat cac 
Trp Ser Phe Val Thr Cys Tyr His Phe Gly Tyr His Lys Glu His His' 
225 230 235 240 

gaa tac cct caa ctt cct tgg tgg aaa tta cct gaa get cac aaa ata 
Glu Tyr Pro Gin Leu Pro Trp Trp Lys Leu Pro Glu Ala His Lys lie 
245. 250 255 

tct tta taa 
Ser Leu 



720 



768 



777 



<210> 26 

<211> 258 

<212> PRT 

<213> Nostoc sp. 

<400> 26 

Met Val Gin Cys Gin Pro Ser Ser Leu His Ser Glu Lys Leu Val Leu 



Leu Ser Ser Thr lie Arg Asp Asp Lys Asn He Asn Lys Gly He Phe 
20 25 30 



He Ala Cys Phe He Leu Phe Leu Trp Ala He Ser Leu He Leu Leu 
35 40 45 



Leu Ser He Asp Thr Ser He He His Lys Ser Leu Leu Gly He Ala 
50 55 60 
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Met Leu Trp Gin Thr Phe Leu Tyr Thr Gly Leu Phe lie Thr Ala His 
65 70 "75 SO 



Asp Ala Met Kis Gly Val Val Tyr Pro Lys Asn Pro Arg lie Asn Asn 
85 90 95 



Phe lie Gly Lys Leu Thr Leu lie Leu Tyr Gly Leu Leu Pro Tyr Lys 
100 105 110 



Asp Leu Leu Lys Lys His Trp Leu His His Gly His Pro Gly Thr Asp 
115 120 125 



Leu Asp Pro Asp Tyr Tyr Asn Gly His Pro Gin Asn Phe Phe Leu Trp 
130 135 140 



Tyr Leu His Phe Met Lys Ser Tyr Trp Arg Trp Thr Gin lie Phe Gly 
145 150 155 160 



Leu Val Met lie Phe His Gly Leu Lys Asn Leu Val His lie Pro Glu 
165 170 175 



Asn Asn Leu He He Phe Trp Met He Pro Ser He Leu Ser Ser Val 
180 185 190 



Gin Leu Phe Tyr Phe Gly Thr Phe Leu Pro His Lys Lys Leu Glu Gly 
195 200 205 



Gly Tyr Thr Asn Pro His Cys Ala Arg Ser lie Pro Leu Pro Leu Phe 
210 215 220 
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Trp Ser Phe Val Thr Cys Tyr His Phe Gly Tyr His Lys Giu His His 
225 230 235 240 



Glu Tyr Pro Gin Leu Pro Trp Trp Lys Leu Pro Giu Ala His Lys lie 
245 250 255 



Ser Leu 



<210> 27 

<211> 789 

<212> DNA 

<213> Nostoc punctiforme 



<220> 

<221> CDS 

<222> (1) .-. (789) 



<400> 27 

ttg aat ttt tgt gat aaa cca gtt age tat tat gtt gca ata gag caa 48 

Leu Asn Phe Cys Asp Lys Pro Val Ser Tyr Tyr Val Ala lie Glu Gin 
15 10 15 

tta agt get aaa gaa gat act gtt tgg ggg ctg gtg att gtc ata gta 96 

Leu Ser Ala Lys Glu Asp Thr Val Trp Gly Leu Val lie Val lie Val 
20 25 30 

att att agt ctt tgg gta get agt ttg get ttt tta eta get att aat 144 

lie lie Ser Leu Trp Val Ala Ser Leu Ala Phe Leu Leu Ala lie Asn 

35 40 45 

tat gee aaa gtc cca att tgg ttg ata cct att gca ata gtt tgg caa 192 
Tyr Ala Lys Val Pro lie Trp Leu lie Pro lie Ala lie Val Trp Gin 
50 55 60 



atg ttc ctt tat aca ggg eta ttt att act gca cat gat get atg cat 240 
Met Phe Leu Tyr Thr Gly Leu Phe lie Thr Ala His Asp Ala Met His 
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65 70 75 80 

ggg tea gtt tat cat aaa aat ccc aaa att aat aat ttt ate ggr tea 288 

Gly Ser Val Tyr Arg Lys Asn Pro Lys lie Asn Asn Phe lie Gly Ser 
85 90 95 



eta get gta gcg ctt tac get gtg ttt cca tat caa cag atg tta aag 
Leu Ala Val Ala Leu Tyr Ala Val Phe Pro Tyr Gin Gin Met Leu Lys 
100 105 110 

aat cat tgc -tta cat cat cgt cat cct get age gaa gtt gac cca gat 
Asn His Cys Leu His His Arg His Pro Ala Ser Glu Val Asp Pro Asp 
115 120 125 

ttt cat gat ggt aag aga aca aac get att ttc tgg tat etc cat ttc- 
Phe His Asp Gly Lys Arg Thr Asn Ala lie Phe Trp Tyr Leu His Phe 
130 135 140 

atg ata gaa tac tec agt tgg caa cag tta ata gta eta act ate eta 
Met lie Glu Tyr Ser Ser Trp Gin Gin Leu lie Val Leu Thr lie Leu 
145 150 155 160 - 

ttt aat tta get aaa tac gtt ttg cac ate cat caa ata aat etc ate 
Phe Asn Leu Ala Lys Tyr Val Leu His lie His Gin lie Asn Leu lie 
165 170 175 

tta ttt tgg agt att cct cca att tta agt tec att caa ctg ttt tat 
Leu Phe Trp Ser lie Pro Pro lie Leu Ser Ser lie Gin Leu Phe Tyr 
180 185 190 

ttc gga aca ttt ttg cct cat cga gaa ccc aag aaa gga tat gtt tat 
Phe Gly Thr Phe Leu Pro His Arg Glu Pro Lys Lys Gly Tyr Val Tyr 
195 200 205 

ccc cat tgc age caa aca ata aaa ttg cca act ttt ttg tea ttt ate 
Pro His Cys Ser Gin Thr lie Lys Leu Pro Thr Phe Leu Ser Phe lie 
210 215 220 



336 



364 



432 



480 



528 



576 



624 



672 



get tgc tac cac ttt ggt tat cat gaa gaa cat cat gag tat ccc cat 
Ala Cys Tyr His Phe Gly Tyr His Glu Glu His His Glu Tyr Pro His 
225 230 235 240 



720 
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gta cct ngg tgg caa ctt cca tct gta tat aag cag aga gta ttc aac 76S 
Vai Pro Trp Trp Gin Leu Pro Ser Val Tyr Lys Gin Arg Val Phe Asn 
245 250 255 



aat tea gta acc aat teg taa 
Asn Ser Val Thr Asn Ser 
260 



<210> 28 

<211> 262 

<212> PRT 

<213> Nostoc punctiforme 



<400> 28 

Leu Asn Phe Cys Asp Lys Pro Val Ser Tyr Tyr Val Ala lie Glu Gin 
15 10 15 



Leu Ser Ala Lys Glu Asp Thr Val Trp Gly Leu Val lie Val lie Val 
20 25 30 



lie He Ser Leu Trp Val Ala Ser Leu Ala Phe Leu Leu Ala He Asn 
35 40 45 



Tyr Ala Lys Val Pro He Trp Leu lie Pro He Ala He Val Trp Gin 
50 55 60 



Met Phe Leu Tyr Thr Gly Leu Phe lie Thr Ala His Asp Ala Met His 
65 70 75 80 



Gly Ser Val Tyr Arg Lys Asn Pro Lys lie Asn Asn Phe He Gly Ser 
85 90 95 



789 



Leu Ala Val Ala Leu Tyr Ala Val Phe Pro Tyr Gin Gin Met Leu Lys 
100 105 110 
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Asn His Cys Leu 
115 



Phe His Asp Gly 
130 



Met lie Giu Tyr 
145 



Phe Asn Leu Ala 



Leu Phe Trp Ser 
180 



Phe Gly Thr Phe 
195 



Pro His Cys Ser 
210 



Ala Cys Tyr His 
225 



Val Pro Trp Trp 



His His Arg His 
120 



Lys Arg Thr Asn 
135 



Ser Ser Trp Gin 
150 



Lys Tyr Val Leu 
165 



lie Pro Pro lie 



Leu Pro His Arg 
200 



Gin Thr He Lys 
215 



Phe Gly Tyr His 
230 



Gin Leu Pro Ser 
245 



Pro Ala Ser Glu 



Ala He Phe Trp 
140 



Gin Leu He Val 
155 



His lie His Gin 
170 



Leu Ser Ser He 
185 



Glu Pro Lys Lys 



Leu Pro Thr Phe 
220 



Glu Glu His His 
235 



Val Tyr Lys Gin 
250 



Val Asp Pro Asp 
125 



Tyr Leu His Phe 



Leu Thr He Leu 
160 



He Asn Leu lie- 
175 



Gin Leu Phe Tyr 
190 



Gly Tyr Val Tyr 
205 



Leu Ser Phe He 



Glu Tyr Pro His 
240 



Arg Val Phe Asn 
255 



Asn Ser Val Thr Asn Ser 
2 60 
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<210> 29 

<2il> 762 

<212> DNA 

<213> Nostoc punctiforme 



<220> 

<221> CDS 

<222> (1) . . (762) 

<400> 29 

gtg ate cag tta gaa caa cca etc agt cat caa gca aaa ctg act cca 

Val lie Gin Leu Glu Gin Pro Leu Ser His Gin Ala Lys Leu Thr Pro 

15 10 15 



gta ctg aga agt aaa tct cag ttt aag ggg ctt ttc att get att gtc 

Val Leu Arg Ser Lys Ser Gin Phe Lys Gly Leu Phe lie Ala lie Val 
20 25 30 

att gtt age gca tgg gtc att age ctg agt tta tta ctt tec ctt gac 

lie Val Ser Ala Trp Val lie Ser Leu Ser Leu Leu Leu Ser Leu Asp 
35 40 45 

ate tea aag eta aaa ttt tgg atg tta ttg cct gtt ata eta tgg caa 

lie Ser Lys Leu Lys Phe Trp Met Leu Leu Pro Val lie Leu Trp Gin 
50 55 60 



aca ttt tta tat acg gga tta ttt att aca tct cat gat gee atg cat 
Thr Phe Leu Tyr Thr Gly Leu Phe lie Thr Ser His Asp Ala Met His 
65 70 75 80 



ggc gta gta ttt ccc caa aac acc aag att aat cat ttg att gga aca 
Gly Val Val Phe Pro Gin Asn Thr Lys lie Asn His Leu lie Gly Thr 
85 90 95 



ttg acc eta tec ctt tat ggt ctt tta cca tat caa aaa eta ttg aaa 
Leu Thr Leu Ser Leu Tyr Gly Leu Leu Pro Tyr Gin Lys Leu Leu Lys 
100 105 110 



aaa cat tgg tta cac cac cac aat cca gca age tea ata gac ccg gat 
Lys His Trp Leu His His His Asn Pro Ala Ser Ser lie Asp Pro Asp 
115 120 125 
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ttt 


cac 


cat 


ggt 




cac 


caa 


agt 


ttc 


ttt 


get 


tgg 


tar 


ttt 


cat 


ttt 


4 32 


Phe 


His 


Asn 


Gly 


Lys 


His 


Gin 


Ser 


Phe 


Phe 


Ala 


Trp 


Tyr 


Phe 


His 


Phe 






130 










135 










140 












atg 


aaa 


ggt 


tac 


tgg 


agt 


tgg 


ggg 


caa 


ata 


att 


gcg 


ttg 


act 


a "C t 


att 


480 


Met 


Lys 


Gly 


Tyr 


Trp 


Ser 


Trp 


Gly 


Gin 


He 


He 


Ala 


Leu 


Thr 


lie 


He 




145 










150 










155 










160 




tat 


aac 


ttt 


get 


aaa 


tac 


ata 


etc 


cat 


ate 


cca 


agt 


gat 


aat 


eta 


act' 


528 


Tyr 


Asn 


Phe 


Ala 


Lys 


Tyr 


He 


Leu 


His 


He 


Pro 


Ser 


Asp 


Asn 


Leu 


Thr 












165 










170 










175 






tac 


ttt 


tgg 


gtg 


eta 


ccc 


teg 


ctt 


tta 


agt 


tea 


tta 


caa 


tta 


ttc 


tat 


576 


Tyr 


Phe 


Trp 


Val 


Leu 


Pro 


Ser 


Leu 


. Leu 


Ser 


Ser 


Leu 


Gin 


Leu 


Phe 


Tyr. 










180 










185 










190 








ttt 


ggt 


act 


ttt 


tta 


ccc 


cat 


agt 


gaa 


cca 


ata 


ggg 


ggt 


tat 


gtt 


cag- 


624 


Phe 


Gly 


Thr 


Phe 


Leu 


Pro 


His 


Ser 


Glu 


Pro 


He 


Gly 


Gly 


Tyr 


Val 


Gin 








195 










200 










205 










cct 


cat 


tgt 


gec 


caa 


aca 


att 


age 


cgt 


cct 


att 


tgg 


tgg 


tea 


ttt 


ate 


672 


Pro 


His 


Cys 


Ala 


Gin 


Thr 


He 


Ser 


Arg 


Pro 


He 


Trp 


Trp 


Ser 


Phe 


He 






210 










215 










220 












acg 


tgc 


tat 


cat 


ttt 


ggc 


tac 


cac 


gag 


gaa 


cat 


cac 


gaa 


tat 


cct 


cat 


720 


Thr 


Cys 


Tyr 


His 


Phe 


Gly 


Tyr 


His 


Glu 


Glu 


His 


His 


Glu 


Tyr 


Pro 


His 




225 










230 










235 










240 




att 


tct 


tgg 


tgg 


cag 


tta 


cca 


gaa 


att 


tac 


aaa 


gca 


aaa 


tag 






762 


He 


Ser 


Trp 


Trp 


Gin 


Leu 


Pro 


Glu 


He 


Tyr 


Lys 


Ala 


Lys 











245 250 



<210> 30 

<211> 253 

<212> PRT 

<213> Nostoc punctiforme 



<400> 30 
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Val lie Gin Leu 
1 

Val Leu Arg Ser 
20 



lie Val Ser Ala 
35 



lie Ser Lys Leu 
50 



Thr Phe Leu Tyr 
65 



Gly Val VaT Phe 



Leu Thr Leu Ser 
100 



Lys His Trp Leu 
115 



Phe His Asn Gly 
130 



Met Lys Gly Tyr 
145 



Glu Gin Pro Leu 
5 

Lys Ser Gin Phe 



Trp Val He Ser 
40 



Lys Phe Trp Met 
55 



Thr Gly Leu Phe 
70 



Pro Gin Asn Thr 
85 



Leu Tyr Gly Leu 



His His His Asn 
120 



Lys His Gin Ser 
135 



Trp Ser Trp Gly 
150 



Ser His Gin Ala 
10 



Lys Gly Leu Phe 
25 



Leu Ser Leu Leu 



Leu Leu Pro Val 
60 



He Thr Ser His 
75 



Lys He Asn His 
90 



Leu Pro Tyr Gin 
105 



Pro Ala Ser Ser 



Phe Phe Ala Trp 
140 



Gin He He Ala 
155 



Lys Leu Thr Pro 
15 



He Ala He Val 
30 



Leu Ser Leu -Asp 
45 



He Leu Trp Gin 



Asp Ala Met His 
80 



Leu He Gly Thr 
95 



Lys Leu Leu Lys 
110 



He Asp Pro Asp 
125 



Tyr Phe His Phe 



Leu Thr He He 
160 



Tyr Asn Phe Ala Lys Tyr He Leu His lie Pro Ser Asp Asn Leu Thr 
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165 170 175 



Tyr Phe Trp Val Leu Pro Ser Leu Leu Ser Ser Leu Gin Leu Phe Tyr 
180 185 190 



Phe Gly Thr Phe Leu Pro His Ser 
195 200 



Pro His Cys Ala Gin Thr lie Ser 
210 215 



Thr Cys Tyr His Phe Gly Tyr His 
225 230 



lie Ser Trp Trp Gin Leu Pro Glu 
245 



Glu Pro He Gly Gly Tyr Val Gin 
205 



Arg Pro He Trp Trp Ser Phe He 
220 



Glu Glu His His Glu Tyr Pro His 
235 240 



He Tyr Lys Ala Lys 
250 



<210> 31 

<211> 1608 

<212> DNA 

<213> Haematococcus pluvialis 



<220> 

<221> CDS 

<222> (3).. (971) 

<400> 31 

ct aca ttt cac aag ccc gtg age ggt gca age get ctg ccc cac ate 47 

Thr Phe His Lys Pro Val Ser Gly Ala Ser Ala Leu Pro His He 
15 10 15 



ggc cca cct cct cat etc cat egg tea ttt get get ace acg atg ctg 
Gly Pro Pro Pro His Leu His Arg Ser Phe Ala Ala Thr Thr Met Leu 
20 25 30 



95 
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gca cac aaa gcc ate tgg cat gag teg cct ctg ggc tgg ctg ctg cac 
Ala His Lys Ala lie Trp His Glu Ser Pro Leu Gly Trp Leu Leu His 
180 185 190 



191 



239 



287 



335 



teg aag ctg cag tea ate age gtc aag gcc cgc cgc gtt gaa eta gcc 143 
Ser Lys Leu Gin Ser lie Ser Val Lys Ala Arg Arg Vai Glu Leu Ala 
35 40 45 

cgc gac ate acg egg ccc aaa gtc tgc ctg cat get cag egg tgc teg 
Arg Asp He Thr Arg Pro Lys Val Cys Leu His Ala Gin Arg Cys Ser 
50 55 60 

tta gtt egg ctg cga gtg gca gca cca cag aca gag gag gcg ctg gga 
Leu Val Arg Leu Arg Val Ala Ala Pro Gin Thr- Glu Glu Ala Leu Gly 
65 70 75 

ace .gtg cag get gcc ggc gcg ggc gat gag cac age gcc gat gta gca 
Thr Val Gin Ala Ala Gly Ala Gly Asp Glu His Ser Ala Asp Val Ala 
80 85 90 95 

etc cag cag ctt gac egg get ate gca gag cgt cgt gcc egg cgc aaa 
Leu Gin Gin Leu Asp Arg Ala He Ala Glu Arg Arg Ala Arg Arg Lys 
100 105 110 

egg gag cag ctg tea tac cag get gcc gcc att gca gca tea att ggc 
Arg Glu Gin Leu Ser Tyr Gin Ala Ala Ala He Ala Ala Ser He Gly 
115 120 125 

gtg tea ggc att gcc ate ttc gcc ace tac ctg aga ttt gcc atg cac 
Val Ser Gly He Ala He .Phe Ala Thr Tyr Leu Arg Phe Ala Met His 
130 135 140 

atg ace gtg ggc ggc gca gtg cca tgg ggt gaa gtg get ggc act etc 
Met Thr Val Gly Gly Ala Val Pro Trp Gly Glu Val Ala Gly Thr Leu 
145 150 155 

etc ttg gtg gtt ggt ggc gcg etc ggc atg gag atg tat gcc cgc tat 527 
Leu Leu Val Val Gly Gly Ala Leu Gly Met Glu Met Tyr Ala Arg Tyr 
160 165 170 175 



383 



431 



479 



575 



aag age cac cac aca cct cgc act gga ccc ttt gaa gcc aac gac ttg 
Lys Ser His His Thr Pro Arg Thr Gly Pro Phe Glu Ala Asn Asp Leu 



623 
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815 



863 



195 200 205 

ttt gca ate ate aat gga ctg ccc gec atg etc ctg tgt ace ttt ggc 671 
Phe Ala He He Asn Gly Leu Pro Ala Met Leu Leu Cys Thr Phe Gly 
210 215 220 

ttc tgg erg ccc aac gtc ctg ggg gcg gee tgc ttt gga gcg ggg ctg 719 
Phe Trp Leu Pro Asn Val Leu Gly Ala Ala Cys Phe Gly Ala Gly Leu 
225 230 235 

ggc ate acg eta tac ggc atg gca tat atg ttt gta cac gat ggc ctg 767 
Gly He Thr Leu Tyr Gly Met Ala Tyr Met Phe Val His Asp Gly Leu 
240 245 250 255 

gtg cac agg cgc ttt ccc ace ggg ccc ate get ggc ctg ccc tac atg- 
Val His Arg Arg Phe Pro Thr Gly Pro He Ala Gly Leu Pro Tyr Met 
260 265 270 

aag cgc ctg aca gtg gee cac cag eta cac cac age ggc aag tac ggt 
Lys Arg Leu Thr Val Ala His Gin Leu His His Ser Gly Lys Tyr Gly 
275 280 . 285 

ggc gcg ccc tgg ggt atg ttc ttg ggt- cca cag gag ctg cag cac att 
Gly Ala Pro Trp Gly Met Phe Leu Gly Pro Gin Glu Leu Gin His He 
290 295 300 

cca ggt gcg gcg gag gag gtg gag cga ctg gtc ctg gaa ctg gac tgg 
Pro Gly Ala Ala Glu Glu Val Glu Arg Leu Val Leu Glu Leu Asp Trp 
305 310 315 

tec aag egg tag ggtgcggaac caggcacgct ggtttcacac ctcatgcctg 1011 

Ser Lys Arg 

320 

tgataaggtg tggctagagc gatgcgtgtg agaegggtat gtcaeggteg actggtctga 1071 

tggccaatgg catcggccat gtctggtcat caegggctgg ttgcctgggt gaaggtgatg 1131 

cacatcatca tgtgcggttg gaggggctgg cacagtgtgg gctgaactgg agcagttgtc 1191 



911 



959 



caggctggcg ttgaatcagt gagggtttgt gattggcggt tgtgaagcaa tgactccgcc 
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catattctat ttgtgggagc tgagatgatg ccatgcttgg gatgtgcarg gaccatggta 1311 

gtgcagcaaa ctatattcac ctagggctgt tggtaggatc aggtgaggcc trgcacattg 1371 

catgatgtac tcgtcatggt gtgttggtga gaggatggat gtggatggat gtgtattctc 1431 

agacgtagac cttgactgga ggcttgatcg agagagtggg ccgtattctt tgagagggga 14 91 

ggctcgtgcc agaaatggtg agtggatgac tgtgacgctg tacattgcag gcaggtgaga 1551 

tgcactgtct cgattgtaaa atacattcag atgcaaaaaa aaaaaaaaaa aaaaaaa 1608 



<210> 32 
<211> 322 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 32 

Thr Phe His Lys Pro Val Ser Gly Ala Ser Ala Leu Pro His lie Gly 
15 10 15 



Pro Pro Pro His Leu His Arg Ser Phe Ala Ala Thr Thr Met Leu Ser 
20 25 30 



Lys Leu Gin Ser lie Ser Val Lys Ala Arg Arg Val Glu Leu Ala Arg 
35 40 45 



Asp lie Thr Arg Pro Lys Val Cys Leu His Ala Gin Arg Cys Ser Leu 
50 55 60 



Val Arg Leu Arg Val Ala Ala Pro Gin Thr Glu Glu Ala Leu Gly Thr 
65 70 75 80 



Val Gin Ala Ala Gly Ala Gly Asp Glu His Ser Ala Asp Val Ala Leu 
85 90 95 
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Gin Gin Leu Asp 
100 



Glu Gin Leu Ser 
115 



Ser Gly He Ala 
130 



Thr Val Gly Gly 
145 



Leu Val Val Gly 



■His Lys Ala He 
180 



Ser His His Thr 
195 



Ala He He Asn 
210 



Trp Leu Pro Asn 
225 



lie Thr Leu Tyr 



Arg Ala He Ala 



Tyr Gin Ala Ala 
120 



He Phe Ala Thr 
135 



Ala Val Pro Trp 
150 



Gly Ala Leu Gly 
165 



Trp His Glu Ser 



Pro Arg Thr Gly 
200 



Gly Leu Pro Ala 
215 



Val Leu Gly Ala 
230 



Gly Met Ala Tyr 
245 



Glu Arg Arg Ala 
105 



Ala He Ala Ala 



Tyr Leu Arg Phe 
140 



Gly Glu Val Ala 
155 



Met Glu Met Tyr 
170 



Pro Leu Gly Trp 
185 



Pro Phe Glu Ala 



Met Leu . Leu Cys 
220 



Ala Cys Phe Gly 
235 



Met Phe Val His 
250 



Arg Arg Lys Arg 
110 



Ser He Gly Val 
125 



Ala Met His Met 



Gly Thr Leu Leu. 

160 



Ala Arg Tyr Ala 
175 



Leu Leu His Lys 
190 



Asn Asp Leu Phe 
205 



Thr Phe Gly Phe 



Ala Gly Leu Gly 
240 



Asp Gly Leu Val 
255 



BASF NAE 878/03 



77/357 



His Arg Arg Phe Pro Thr Gly Pro lie Ala Gly Leu Pro Tyr Met Lys 
260 265 270 



Arg Leu Thr Val Ala His Gin Leu His His Ser Gly Lys Tyr Gly Gly 
275 280 285 



Ala Pro Trp Gly Met Phe Leu Gly Pro Gin Glu Leu Gin His lie Pro 



290 



295 



300 



Gly Ala Ala Glu Glu Val Glu Arg Leu Val Leu Glu Leu Asp Trp Ser 
305 310 315 320 



Lys Arg 



<210> 33 

<211> 528 • 

<212> DNA 

<213> Erwinia uredovora 



<220> 

<221> CDS 

<222> (1) . . (528) 

<400> 33 

atg ttg tgg att tgg aat gcc ctg ate gtt ttc gtt acc gtg att ggc 48 

Met Leu Trp lie Trp Asn Ala Leu lie Val Phe Val Thr Val lie Gly 
15 10 15 

atg gaa gtg att get gca ctg gca cac aaa tac ate atg cac ggc tgg 96 
Met Glu Val lie Ala Ala Leu Ala His Lys Tyr lie Met His Gly Trp 



20 



25 



30 



ggt tgg gga tgg cat 
Gly Trp Gly Trp His 



ctt tea cat cat gaa ccg cgt aaa ggt gcg ttt 
Leu Ser His His Glu Pro Arg Lys Gly Ala Phe 



35 



40 



45 
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gaa gtt aac gat ctt tat gcc gtg gtt ttt get gca tta teg ate ctg 192 
Glu Val Asn Asp Leu Tyr Ala Val Val Phe Ala Ala Leu Ser He Leu 
50 55 60 

ctg att tat ctg ggc agt aca gga atg tgg ccg etc cag tgg att ggc 240 
Leu He Tyr Leu Gly Ser Thr Gly Met Trp Pro Leu Gin Trp lie Gly 
65 70 75 80 

gca ggt atg acg gcg tat gga tta etc tat ttt atg gtg cac gac ggg" 288 
Ala Gly Met Thr Ala Tyr Gly Leu Leu Tyr Phe Met Val His Asp Gly 
85 90 95 

ctg gtg cat caa cgt tgg cca ttc cgc tat att cca cgc aag ggc tac 336 
Leu Val His Gin Arg Trp Pro Phe Arg Tyr He Pro Arg Lys Gly Tyr. 
100 105 110 

etc aaa egg ttg tat atg gcg cac cgt atg cat cac gcc gtc agg ggc 384 
Leu Lys Arg Leu Tyr Met Ala His Arg Met His His Ala Val Arg Gly 
115 120 125 

aaa gaa ggt tgt gtt tct ttt ggc ttc etc tat gcg ccg ccc ctg tea 432 
Lys Glu Gly Cys Val Ser Phe Gly Phe Leu Tyr Ala Pro Pro Leu Ser 
130 135 140 



aaa ctt cag gcg acg etc egg gaa aga cat ggc get aga gcg ggc get 

Lys Leu Gin Ala Thr Leu Arg Glu Arg His Gly Ala Arg Ala Gly Ala 
145 150 155 160 

gcc aga gat gcg cag ggc ggg gag gat gag ccc gca tec ggg aag taa 

Ala Arg Asp Ala Gin Gly Gly Glu Asp Glu Pro Ala Ser Gly Lys 
165 170 175 



480 



528 



<210> 34 

<211> 175 

<212> PRT 

<213> Erwinia uredovora 

<400> 34 



Met Leu Trp He Trp Asn Ala Leu He Val Phe Val Thr Val lie Gly 
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i 



Met Glu Val lie 
20 



Gly Trp Gly Trp 
35 



Glu Val Asn Asp 
50 



Leu lie Tyr Leu 
65 



Ala Gly Met Thr 



Leu Val His Gin 
100 



Leu Lys Arg Leu 
115 



Lys Glu Gly Cys 
130 



Lys Leu Gin Ala 
145 



5 



Ala Ala Leu Ala 



His Leu Ser His 
40 



Leu Tyr Ala Val 
55 



Gly Ser Thr Gly 
70 



Ala Tyr Gly Leu 
85 



Arg Trp Pro Phe 



Tyr Met Ala His 
120 



Val Ser Phe Gly 
135 



Thr Leu Arg Glu 
150 



10 



His Lys Tyr lie 
25 



His Glu Pro Arg 



Val Phe Ala Ala 
60 



Met Trp Pro Leu 
75 



Leu Tyr Phe Met 
90 



Arg Tyr lie Pro 
105 



Arg Met His His 



Phe Leu Tyr Ala 
140 



Arg His Gly Ala 
155 



15 



Met His Gly Trp 
30 



Lys Gly Ala Phe 
45 



Leu Ser lie Leu 



Gin Trp lie Gly 
80 



Val His Asp Gly 
95 



Arg Lys Gly Tyr 
110 



Ala Val Arg Gly 
125 



Pro Pro Leu Ser 



Arg Ala Gly Ala 
160 



Ala Arg Asp Ala Gin Gly Gly Glu Asp Glu Pro Ala Ser Gly Lys 
165 170 175 
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<210> 35 

<211> 1520 

<212> DNA 

<213> Artificial 

<220> 

<223> Promoter 



<400> 35 

ctcgagtacc gaggcggaac ggcaggaatg tttccctctc ttttagaggg caattcttta 60 

tccaatgtca tgttgatgct agatatttct gtctcttata ataaggcgaa tacccatttt 120 

tgaattgaag ttgagataaa aaaaaagggg gcccaatttg tcaacgccaa agagtcaagc 180 

tttttctttg gctttagccg aacaatctaa gacttattgt ttttgaagat atttgacctt 240 

ttctagatat tccttcaagt aaagcttttt tcgagttttt tttttttttc tttgtgaagg 300 

atttattgtt attggtatcc attttttatt ggaagacaag ataagttaat attgattttg 360 

cttaaagatt aaaaggaaat cagaaaacga caataaaaaa tgtaacggac aaactatggt 420 

gtcgattata agtctaaatc cttaaaaaat gacaacgagt tgctttcctc tgaaaacaat 480 

tcttttgtct ttgcaagaaa ggtttctttt ttgtttgctt gcattactta aacatcaaat 540 

caaatgaaag gaataaagca gatttgaggg cgaataagga ttttctggtc aacaagatgt 600 

gagtgacacc taaggaacta aatgccattc atttgtttta aaacgacatc aaagattgat 660 

gatcaacagg attgagagag agaaaaagaa ctcgtgtcat ttatttctgt tgactgaaat 720 

tttatattta gaaaaaatgt caaatctata gctttagcta tattacataa catttgaaat 780 

aataataata aaaaaagaca cattagagac acttttcaaa ctctaaataa ctgtctataa 840 

acacaaagaa aacaaagacc tctataacaa cttattagat ttttctcgta cttttgtcta 900 

aagatgatgt attcttgtta tcccacactt ctttcatttg ttcttgatgc tactaaatat 960 
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acaaaatttc ttttttgcaa gagatattat tccaaaaatt ttcaaaaaga aatttttttc 1020 

acaatagcag ttgatcgtgt aacccaaaga ggttctttgt tattttgcac ttccgctttg 1080 

cggtgatgca tattcaaagt aatatatgga ataaacaacg tgtttaagca tgaaagaaag 1140 

gaaacaaagg ccgctttgaa caaatgcata atatttcaga caaaaatgat ctaaagcaag 1200 

cagtaaatca aacaagaaac attgctgatt cgcgttagaa aacgataaaa gtctaataag 1260 

ccactaagta tacttcaatg aactttttgt atgcttatgg tccaatcaga ccaataattt 1320 

gtgaccattc ctgaggtggc tttggtgatg cggaaacaga aaaaaatttt ctcaccaatc 1380 

gatttaaaaa acaatttctg ctttgaacca aaactttttt tttctcttta atcattaact 1440 

ttatcaagta tgtacctacc ctcaaagtcc tcactcaagc acaattatgc taacattgtt 1500 

ccaccttctc tttagaaatg 1520 



<210> 36 

<211> 16245 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid 



<220> 

<221> misc_f eature 

<222> (10264 ) . . (10264 ) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10472) . . (10472) 

<223> n is a, c, g, or t 



<220> 
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<221> * misc_f eature 
<222> (10563) . . (10563) 
<223> n is a, c, g, or t 

<400> 36 

ccgggctggt tgccctcgcc gctgggctgg 
aaacgccgtc gaagccgtgt gcgagacacc 
aaaacttggc cctcactgac agatgagggg 
ccggcgcggc gttgacagat gaggggcagg 
cagcctcgca aatcggcgaa aacgcctgat 
caagcctggg gataagtgcc ctgcggtatt 
gaggggcgcg atccttgaca cttgaggggc 
tgacatttga ggggctgtcc acaggcagaa 
ttttcggcca ccgctaacct gtcttttaac 
tttttaacca gggctgcgcc ctgtgcgcgt 
cttctcgaac cctcccggcc cgctaacgcg 
tcggccgcga acggcctcac cccaaaaatg 
atcggggcag taacgggatg ggcgatcagc 
ccgcaggtgc tggcatcgac attcagcgac 
ggtggcggcc tgcccttcac ttcggccgtc 
gcaattttta ccttgggcat tcttggcata 
ggtgcgataa acccagcgaa ccatttgagg 
acgagaattg gacctttaca gaattactct 
acgaagagga tgaagaggat gaggaggcag 



82/357 

cggccgtcta tggccctgca aacgcgccag 
gcggccgccg gcgttgtgga tacctcgcgg 
cggacgttga cacttgaggg gccgactcac 
ctcgatttcg gccggcgacg tggagctggc 
tttacgcgag t.ttcccacag atgatgtgga 
gacacttgag gggcgcgact actgacagat 
agagtgctga cagatgaggg gcgcacctat 
aatccagcat ttgcaagggt ttccgcccgt 
ctgcttttaa accaatattt ataaaccttg 
gaccgcgcac gccgaagggg ggtgcccccc 
ggcctcccat ccccccaggg gctgcgcccc 
gcagcgctgg cagtccttgc cattgccggg 
ccgagcgcga cgcccggaag cattgacgtg 
caggtgccgg gcagtgaggg cggcggcctg 
ggggcattca cggacttcat ggcggggccg 
gtggtcgcgg gtgccgtgct cgtgttcggg 
tgataggtaa gattataccg aggtatgaaa 
atgaagcgcc atatttaaaa agctaccaag 
attgccttga atatattgac aatactgata 
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agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcc cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 
tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 
ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 
aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 
aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 
ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgttat 2220 
aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 
taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 
tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 
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tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 2460 

catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 
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tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcga<gccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccg<gtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg- 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 

gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4 620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4 920 
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tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 5460 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 
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taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg '6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 714 0 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 
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aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttg'actaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat -gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 



BASF NAE 878/03 90/357 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt aatctataca 10800 

atgctccata gactcacatt gatattgtcg aagatttcga tgctgactta gtagagcaac 10860 

tacaaaagtt agcagagaag catgatttct taatctttga agaccgcaag tttgcagata 10920 

tcggtatgtg aattctatct attttttttc tgatgtgtgc atggatgact catgatcata 10980 

ttcttaggta atactgtcaa gcatcaatat ggcaagggcg tttacaagat tgcttcttgg 11040 

tctcatatta ctaatgctca cacagttcct ggagaaggta ttatcaaggg acttgccgaa 11100 

gtcggcctcc ctcttggtcg tggcttgctt ttgctagcag aaatgtcatc tcaaggtgca 11160 

ttaactaagg gtatttacac tgccgaatct gtcaatatgg ctcgccgcaa caaagatttc 11220 
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gtttttggct ttattgcaca acacaaaatg aatcagtatg atgatgagga ttttgttgtc 11280 

atgtcgcctg aagcttggcg taatcatggt catagctgtt tcctgtgtga aattgttatc 11340 

cgctcacaat tccacacaac atacgagccg gaagcataaa gtgtaaagcc tggggtgcct 11400 

aatgagtgag ctaactcaca ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa 114 60 

acctgtcgtg ccagctgcat taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta 11520 

ttgggccaaa gacaaaaggg cgacattcaa ccgattgagg gagggaaggt aaatattgac 11580 

ggaaattatt cattaaaggt gaattatcac cgtcaccgac ttgagccatt tgggaattag 11640 

agccagcaaa atcaccagta gcaccattac cattagcaag gccggaaacg tcaccaatga 11700 

aaccatcgat agcagcaccg taatcagtag cgacagaatc aagtttgcct ttagcgtcag 11760 

actgtagcgc gttttcatcg gcattttcgg tcatagcccc cttattagcg tttgccatct 11820 

tttcataatc aaaatcaccg gaaccagagc caccaccgga accgcctccc tcagagccgc 11880 

caccctcaga accgccaccc tcagagccac caccctcaga gccgccacca gaaccaccac 11940 

cagagccgcc gccagcattg acaggaggcc cgatctagta acatagatga caccgcgcgc 12000 

gataatttat cctagtttgc gcgctatatt ttgttttcta tcgcgtatta aatgtataat 12060 

tgcgggactc taatcataaa aacccatctc ataaataacg tcatgcatta catgttaatt 12120 

attacatgct taacgtaatt caacagaaat tatatgataa tcatcgcaag accggcaaca 12180 

ggattcaatc ttaagaaact ttattgccaa atgtttgaac gatcggggat catccgggtc 12240 

tgtggcggga actccacgaa aatatccgaa cgcagcaaga tatcgcggtg catctcggtc 12300 

ttgcctgggc agtcgccgcc gacgccgttg atgtggacgc cgggcccgat catattgtcg 12360 

ctcaggatcg tggcgttgtg cttgtcggcc gttgctgtcg taatgatatc ggcaccttcg 12420 

accgcctgtt ccgcagagat cccgtgggcg aagaactcca gcatgagatc cccgcgctgg 12480 
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aggatcatcc agccggcgtc ccggaaaacg attccgaagc ccaacctttc atagaaggcg 12540 

gcggtggaat cgaaatctcg tgatggcagg ttgggcgtcg cttggtcggt catttcgaac 12600 

cccagagtcc cgctcagaag aactcgtcaa gaaggcgata gaaggcgatg cgctgcgaat 12660 

cgggagcggc gataccgtaa agcacgagga agcggtcagc ccattcgccg ccaagctctt 12720 

cagcaatatc acgggtagcc aacgctatgt cctgatagcg gtccgccaca cccagccggc 12780 

cacagtcgat gaatccagaa aagcggccat tttccaccat gatattcggc aagcaggcat 12840 

cgccatgggt cacgacgaga tcatcgccgt cgggcatgcg cgccttgagc ctggcgaaca 12900 

gttcggctgg cgcgagcccc tgatgctctt cgtccagatc atcctgatcg acaagaccgg 12960 

cttccatccg agtacgtgct cgctcgatgc gatgtttcgc ttggtggtcg aatgggcagg 13020 

tagccggatc aagcgtatgc agccgccgca ttgcatcagc catgatggat actttctcgg 13080 

caggagcaag gtgagatgac aggagatcct gccccggcac ttcgcccaat agcagccagt 13140 

cccttcccgc ttcagtgaca acgtcgagca cagctgcgca aggaacgccc gtcgtggcca 13200 

gccacgatag ccgcgctgcc tcgtcctgca gttcattcag ggcaccggac aggtcggtct 13260 

tgacaaaaag aaccgggcgc ccctgcgctg acagccggaa cacggcggca tcagagcagc 13320 

cgattgtctg ttgtgcccag tcatagccga atagcctctc cacccaagcg gccggagaac 13380 

ctgcgtgcaa tccatcttgt tcaatcatgc gaaacgatcc agatccggtg cagattattt 13440 

ggattgagag tgaatatgag actctaattg gataccgagg ggaatttatg gaacgtcagt 13500 

ggagcatttt tgacaagaaa tatttgctag ctgatagtga ccttaggcga cttttgaacg 13560 

cgcaataatg gtttctgacg tatgtgctta gctcattaaa ctccagaaac ccgcggctga 13620 

gtggctcctt caacgttgcg gttctgtcag ttccaaacgt aaaacggctt gtcccgcgtc 13680 

atcggcgggg gtcataacgt gactccctta attctccgct catgatcaga ttgtcgtttc 13740 
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ccgccttcag tttaaactat cagtgtttga caggatatat tggcgggtaa acctaagaga 13800 

aaagagcgtt tattagaata atcggatatt taaaagggcg tgaaaaggtt tatccgttcg 13860 

tccatttgta tgtgcatgcc aaccacaggg ttccccagat ctggcgccgg ccagcgagac 13920 

gagcaagatt ggccgccgcc cgaaacgatc cgacagcgcg cccagcacag gtgcgcaggc 13980 

aaattgcacc aacgcataca gcgccagcag aatgccatag tgggcggtga cgtcgttcga 14040 

gtgaaccaga tcgcgcagga ggcccggcag caccggcata atcaggccga tgccgacagc 14100 

gtcgagcgcg acagtgctca gaattacgat caggggtatg ttgggtttca cgtctggcct 14160 

ccggaccagc ctccgctggt ccgattgaac gcgcggattc tttatcactg ataagttggt 14220 

ggacatatta tgtttatcag tgataaagtg tcaagcatga caaagttgca gccgaataca 14280 

gtgatccgtg ccgccctgga cctgttgaac gaggtcggcg tagacggtct gacgacacgc 14340 

aaactggcgg aacggttggg ggttcagcag ccggcgcttt actggcactt caggaacaag 14400 

cgggcgctgc tcgacgcact ggccgaagcc atgctggcgg agaatcatac gcattcggtg 14 4 60 

ccgagagccg acgacgactg gcgctcattt ctgatcggga atgcccgcag cttcaggcag 14520 

gcgctgctcg cctaccgcga tggcgcgcgc atccatgccg gcacgcgacc gggcgcaccg 14580 

cagatggaaa cggccgacgc gcagcttcgc ttcctctgcg aggcgggttt ttcggccggg 14640 

gacgccgtca atgcgctgat gacaatcagc tacttcactg ttggggccgt gcttgaggag 14700 

caggccggcg acagcgatgc cggcgagcgc ggcggcaccg ttgaacaggc tccgctctcg 14760 

ccgctgttgc gggccgcgat agacgccttc gacgaagccg gtccggacgc agcgttcgag 14820 

cagggactcg cggtgattgt cgatggattg gcgaaaagga ggctcgttgt caggaacgtt 14880 

gaaggaccga gaaagggtga cgattgatca ggaccgctgc cggagcgcaa cccactcact 14 940 

acagcagagc catgtagaca acatcccctc cccctttcca ccgcgtcaga cgcccgtagc 15000 
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agcccgctac gggctttttc atgccctgcc ctagcgtcca agcctcacgg ccgcgctcgg 15060 

cctctctggc ggccttctgg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg 15120 

tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag 15180 

aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc 15240 

gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca 15300 

aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt 15360 

ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc 15420 

tgtccgcctt tctcccttcg ggaagcgtgg cgcttttccg ctgcataacc ctgcttcggg 15480 

gtcattatag cgattttttc ggtatatcca tcctttttcg cacgatatac aggattttgc 15540 

caaagggttc gtgtagactt tccttggtgt atccaacggc gtcagccggg caggataggt 15600 

gaagtaggcc cacccgcgag cgggtgttcc ttcttcactg tcccttattc gcacctggcg 15660 

gtgctcaacg ggaatcctgc tctgcgaggc tggccggcta ccgccggcgt aacagatgag 15720 

ggcaagcgga tggctgatga aaccaagcca accaggaagg gcagcccacc tatcaaggtg 15780 

tactgccttc cagacgaacg aagagcgatt gaggaaaagg cggcggcggc cggcatgagc 1584 0 

ctgtcggcct acctgctggc cgtcggccag ggctacaaaa tcacgggcgt cgtggactat 15900 

gagcacgtcc gcgagctggc ccgcatcaat ggcgacctgg gccgcctggg cggcctgctg 15960 

aaactctggc tcaccgacga cccgcgcacg gcgcggttcg gtgatgccac gatcctcgcc 16020 

ctgctggcga agatcgaaga gaagcaggac gagcttggca aggtcatgat gggcgtggtc 16080 

cgcccgaggg cagagccatg acttttttag ccgctaaaac ggccgggggg tgcgcgtgat 16140 

tgccaagcac gtccccatgc gctccatcaa gaagagcgac ttcgcggagc tggtgaagta 16200 

catcaccgac gagcaaggca agaccgagcg cctttgcgac gctca 16245 
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misc_f eature 
(10264) . . (10264) 
n is a, c, g, or t 

misc_feature 
(10472) . . (10472) 
n is a, c, g, or t 

misc_f eature 
(10563) . . (10563) 
n is a, c, g, or t 



ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 



60 



aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 



120 



aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 



180 



ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 



240 



cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 



300 



caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 



360 



gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 



420 



tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 
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ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 

atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 
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tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 

tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 

ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagpacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 24 60 

catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 
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cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctatitttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

r 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 
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gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 

gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4 680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4 8 60 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 5460 
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gccaatcccg atgcctacag caaccaatgt tctcggcctg gcgtggctcc gcctgatcgg 5520 

agcgggttra acctacttcc tttggttccg ggggatctcg cgactccaac ctacagttgt 55S0 

ttccttactg ggctttctca gccccagatc tggggtccat cagccgggga tgcatcaggc 5640 

cgacagncgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaaiga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 
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tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tqaattcgag 6730 

ctcggraccc ggggatcttt cgacactgaa atacgtcgag ccigctccgc ttggaagcgg 6-340 

cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc acttccccca 6300 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

guacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacc tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctitc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 
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tctcaccttc 
gcccatcaai 
accgcaagga 
ccat gtgTiat 
t'ctcga tgag 
ggatttcggc 
gagcgaggcg 
gtggttggct 

aggatcgccg 
cttggttgac 
ccgatccgga 
gaccgatggc 
gagggcaaag 
tcatcaaaca 
ttgagacaaa 
gtgccttcta 
cctcttccag 
cttncaggct 
gagatcaagc 
tccacgcgac 



caagacctgc 
gcgatcgctc 
at cggt caat 
cactggcaaa 
ctgatgcttt 
tccaacaatg 
a tgt t cgggg 
tgtatggagc 

cggctccggg 
ggcaatttcg 
gccgggactg 
tgtgtagaag 
gaatagagta 
gcttgacgaa 
tggtgttcag 
gtgatttaat 
atacagctca 
ccggcgaaga 
agatcaacgg 
tatatatttg 



ctgaaaccga 
cggccgatct 
acactacatg 
ctgtgatgga 
gggccgagga 
tcctgacgga 
attcccaata 
agcagacgcg 

cgtatatgct 
atgatgcagc 
tcgggcgtac 
tactcgccga 
gatgccgacc 
tctggatata 
gatctcgata 
agctccatgt 
tctgcaatgc 
gaagaatagc 
tcgtcaagag 
tctctaattg 



actgcccgct 
tagccagacg 
gcgt gart tc 
cgacaccgtc 
ctgccccgaa 
caatggccgc 
cgaggtcgcc 
ctacttcgag 

ccgcattggt 
ttgggcgcag 
acaaatcgcc 
tagtggaaac 
gcgggatcga 
agatcgttgg 
agatacgttc 
caacaagaat 
attaatgcat 
ttagcagagc 
acctacgaga 
tactttgaca 



gttctgcagc 
agcgggttcg 
atatgcgcga 
agtgcgtccg 
gtccggcacc 
ataacagcgg 
aacatcttct 
cggaggcatc 

cttgaccaac 
ggtcgatgcg 
cgcagaagcg 
cgacgcccca 
tccacttaac 
tgtcgatgtc 
atttgtccaa 
aaaacgcgtt 
tgactgcaac 
tattttcatt 
ctgaggaatc 
tgctcctctt 



ccgtcgcgga 
gcccat rcgg 
t.tgctgarcc 
tcgcgcaggc 
tcgtgcacgc 
tcattgactg 
tctggaggcc 
cggagcttgc 

tctatcagag 
acgcaatcgt 
cggccgtctg 
gcactcgtcc 
gttactgaaa 
agctccggag 
gcagcaaaga 
ttcgggttta 
ctagtaacgc 
ttcgggagac 
cgctcttggc 
ctttactctg 



9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 

9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
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ataccttgac tatgaaaatt ccgtcaccac cncctgggtt cgcaaagata attgcatctt: 10500 

tcttccttga ccictcaaoc ctacaggaca cacattcatc giagcia taa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaaigctccc 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc .ctatgactcg tttacccaga 106S0 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtacT: 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt ttttcgagtt 10800 

tttttttttt ttctttgrga aggatttatt gttattggta tccatttttt atrggaagac 10860 

aagataagtt aatattgatt ttgcttaaag attaaaagga aatcagaaaa cgacaataaa 10920 

aaatgtaacg gacaaactat ggtgtcgatt ataagtctaa atccttaaaa aatgacaacg 10980 

agttgctttc ctctgaaaac aattcttttg tctttgcaag aaaggtttct tttttgtttg 11040 

cttgcattac ttaaacatca aatcaaatga aaggaataaa gcagatttga gggcgaataa 11100 

ggattttctg gtcaacaaga tgtgagtgac acctaaggaa ctaaatgcca ttcatttgtt 11160 

ttaaaacgac atcaaagatt gatgatcaac aggattgaga gagagaaaaa gaactcgtgt 11220 

catttatttc tgttgactga aattttatat ttagaaaaaa tgtcaaatct atagctttag 11280 

ctatattaca taacatttga aataataata ataaaaaaag acacattaga gacacttttc 11340 

aaactctaaa taactgtcta taaacacaaa gaaaacaaag acctctataa caacttatta 11400 

gatttttctc gtacttttgt ctaaagatga tgtattcttg ttatcccaca cttctttcat 11460 

ttgttcttga tgctactaaa tatacaaaat ttcttttttg caagagatat tattccaaaa 11520 

attttcaaaa agaaattttt ttcacaatag cagttgatcg tgtaacccaa agaggttctt 11580 

tgttattttg cacttccgct ttgcggtgat gcatattcaa agtaatatat ggaataaaca 11640 

acgtgtttaa gcatgaaaga aaggaaacaa aggccgcttt gaacaaatgc ataatatttc 11700 
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agacaaaaat: cazazaaagc aaccagtaaa tcaaacaaga aacattgctg attcccgtta 1 1 7 60 

gaaaacgata aaagictaat aagccactaa gtatacttca atcaact"tt rgtatgcttia IIS20 

rggtccaatc agaccaataa tttgtgacca ttcctgaggt ggctttggtg atgcggaaac 11880 

agaaaaaaat ttictcacca atcgatttaa aaaacaattt ctgctttgaa ccaaaacut 11940 

ttttttctct ttaatcatta actttatcaa gtatgtacct accctcaaag tcctcactca 12000 

agcacaatta tgctaacatt gttccacctt ctctttagaa atgctgtcga agctgcagtc 12060 

aatcagcgtc aaggcccgcc gcgttgaact agcccgcgac atcacgcggc ccaaagtctg 12120 

cctgcatgct cagcggtgct cgttagttcg gctgcgagtg gcagcaccac agacagag.ga 12180 

ggcgctggga accgtgcagg ctgccggcgc gggcgatgag cacagcgccg atgtagcact 12240 

ccagcagctt gaccgggcta tcgcagagcg tcgtgcccgg cgcaaacggg agcagctgtc 12300 

ataccaggct gccgccattg cagcatcaat tggcgtgtca ggcattgcca tcttcgccac -12360 

ctacctgaga tttgccatgc acatgaccgt gggcggcgca gtgccatggg gtgaagtggc 12420 

tggcactctc ctcttggtgg ttggtggcgc gctcggcatg gagatgtatg cccgctatgc 12480 

acacaaagcc atctggcatg agtcgcctct gggctggctg ctgcacaaga gccaccacac 12540 

acctcgcact ggaccctttg aagccaacga cttgtttgca atcatcaatg gactgcccgc 12600 

catgctcctg tgtacctttg gcttctggct gcccaacgtc ctgggggcgg cctgctttgg 12660 

agcggggctg ggcatcacgc tatacggcat ggcatatatg tttgtacacg atggcctggt 12720 

gcacaggcgc tttcccaccg ggcccatcgc tggcctgccc tacatgaagc gcctgacagt 12780 

ggcccaccag ctacaccaca gcggcaagta cggtggcgcg ccctggggta tgttcttggg 12840 

tccacaggag ctgcagcaca ttccaggtgc ggcggaggag gtggagcgac tggtcctgga 12900 

actggactgg tccaagcggt agaagcttgg cgtaatcatg gtcatagctg tttcctgtgt 12960 
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gaaattgtta tccgctcaca attccacaca acatacgagc cggaagcata aagtctaaac 13020 

cctggggtgc ctaatgagtg agctaactca cattaattgc gttgcgctca ctgcccgcrt I30SO 

tccagtcggg aaacctgtcg tgccagctgc attaatgaat cggccaacgc gcggggagag 13140 

gcggtttgcg tattgggcca aagacaaaag ggcgacattc aaccgattga gggagggaag 13200 

gtaaatattg acggaaatta ttcattaaag gtgaattatc accgtcaccg acttgagcca 13260 

tttgggaatt agagccagca aaatcaccag tagcaccatt accattagca aggccggaaa 13320 

cgtcaccaat gaaaccatcg atagcagcac cgtaatcagt agcgacagaa tcaagtttgc 13380 

crttagcgtc agactgtagc gcgttttcat cggcattttc ggtcatagcc cccttattag 13440 

cgtttgccat cttttcataa tcaaaatcac cggaaccaga gccaccaccg gaaccgcctc 13500 

cctcagagcc gccaccctca gaaccgccac cctcagagcc accaccctca gagccgccac 13560 

cagaaccacc 'accagagccg ccgccagcat tgacaggagg cccgatctag taacatagat 13620 

gacaccgcgc gcgataattt atcctagttt gcgcgctata ttttgttttc tatcgcgtat 13680 

taaatgtata attgcgggac tctaatcata aaaacccatc tcataaataa cgtcatgcat 13740 

tacatgttaa ttattacatg cttaacgtaa ttcaacagaa attatatgat aatcatcgca 13800 

agaccggcaa caggattcaa tcttaagaaa ctttattgcc aaatgtttga acgatcgggg 13860 

atcatccggg tctgtggcgg gaactccacg aaaatatccg aacgcagcaa gatatcgcgg 13920 

tgcatctcgg tcttgcctgg gcagtcgccg ccgacgccgt tgatgtggac gccgggcccg 13980 

atcatattgt cgctcaggat cgtggcgttg tgcttgtcgg ccgttgctgt cgtaatgata 14040 

tcggcacctt cgaccgcctg ttccgcagag atcccgtggg cgaagaactc cagcatgaga 14100 

tccccgcgct ggaggatcat ccagccggcg tcccggaaaa cgattccgaa gcccaacctt 14160 

tcatagaagg cggcggtgga atcgaaatct cgtgatggca ggttgggcgt cgcttggtcg 14220 
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gtcatttcga 

tgcgctgcga 

cgccaagctc 

cacccagccg 

gcaagcaggc 

gcctggcgaa 

cgacaagacc 

cgaat gggca 

atactttctc 

atagcagcca 

ccgtcgtggc 

acaggtcggt 

catcagagca 

cggccggaga 

tgcagattat 

tggaacgtca 

gact tttgaa 

acccgcggct 

ttgtcccgcg 

gattgtcgtt 

aaacctaaga 



accccagagt 
atcgggagcg 
t t cagcaa t a 
gccacacicg 
atcgccatgg 
cagttcggct 
ggcttccatc 
ggtagccgga 
ggcaggagca 
gtcccttccc 
cagccacgat 
cttgacaaaa 
gccgattgtc 
acctgcgtgc 
ttggattgag 
gtggagcatt 
cgcgcaataa 
gagtggctcc 
-tcatcggcgg 
tcccgccttc 
gaaaagagcg 



cccgctcaga 

gcgataccgt 

tcacgggtag 

atgaatccag 

gtcacgacga 

ggcgcgagcc 

cgagtacgtg 

tcaagcgtat 

aggtgagatg 

gcttcagtga 

agccgcgctg 

agaaccgggc 

tgttgtgccc 

aatccatctt 

agtgaatatg 

tttgacaaga 

tggtttctga 

ttcaacgttg 

gggtcataac 

agtttaaact 

tttattagaa 



agaactcgtc 
aaagcacgag 
ccaacgctat 
aaaagcggcc 
gatcatcgcc 
cctgatgctc 
ctcgctcgat 
gcagccgccg 
acaggagatc 
caacgtcgag 
cctcgtcctg 
gcccctgcgc 
agtcatagcc 
gttcaatcat 
agactctaat 
aatatttgct 
cgtatgtgct 
cggttctgtc 
gtgactccct 
atcagtgttt 
taatcggata 



aagaaggcga 
gaagcggtca 
gtcctgatag 
attttccacc 
gtcgggcatg 
ttcgtccaga 
gcgatgtttc 
cattgcatca 
ctgccccggc 
cacagctgcg 
cagttcattc 
tgacagccgg 
gaatagcctc 
gcgaaacgat 
tggataccga 
agctgatagt 
tagctcatta 
agttccaaac 
taattctccg 
gacaggatat 
tttaaaaggg 



tagaaggcga 
gcccattcgc 
cggtccgcca 
atgat artcg 
cgcgccttga 
tcatcctgat 
gcttggtggt 
gccat gatgg 
acttcgccca 
caaggaacgc 
agggcaccgg 
aacacggcgg 
tccacccaag 
ccagatccgg 
ggggaattta 
gaccttaggc 
aactccagaa 
gtaaaacggc 
ctcatgatca 
attggcgggt 
cgtgaaaagg 



I42SC 
14340 
14400 
144 60 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
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tttatccgtt cg-ccatttg tatgtgcatg ccaaccacag ggttccccag atctggcgcc 15540 

ggccagcgag acgagcaaga ttggccgccg cccgaaacga tccgacagcg cgcccagcac 15600 

acgtgcgcag gcaaattgca ccaacgcata cagcgccagc agaatgccai agtgggcggt 15650 

gacgtcgttc gagtgaacca gatcgcgcag gaggcccggc agcaccggca taatcaggcc 15720 

gatgccgaca gcgtcgagcg cgacagtgct cagaattacg atcaggggta tgttgggttt 15780 

cacgtctggc ctccggacca gcctccgctg gtccgattga acgcgcggat tctttatcac 15S40 

tgataagttg gtggacatat tatgtttatc agtgataaag tgtcaagcat gacaaagttg 15900 

cagccgaata cagtgatccg tgccgccctg gacctgttga acgaggtcgg .cgtagacggt 15960 

ctgacgacac gcaaactggc ggaacggttg ggggttcagc agccggcgct ttactggcac 16020 

ttcaggaaca agcgggcgct gctcgacgca ctggccgaag ccatgctggc ggagaatcat 16080 

acgcattcgg tgccgagagc cgacgacgac tggcgctcat ttctgatcgg gaatgcccgc 16140 

agcttcaggc aggcgctgct cgcctaccgc gatggcgcgc gcatccatgc cggcacgcga 16200 

ccgggcgcac cgcagatgga aacggccgac gcgcagcttc gcttcctctg cgaggcgggt 16260 

ttttcggccg gggacgccgt caatgcgctg atgacaatca gctacttcac tgttggggcc 16320 

gtgcttgagg agcaggccgg cgacagcgat gccggcgagc gcggcggcac cgttgaacag 16380 

gctccgctct cgccgctgtt gcgggccgcg atagacgcct tcgacgaagc cggtccggac 16440 

gcagcgttcg agcagggact cgcggtgatt gtcgatggat tggcgaaaag gaggctcgtt 16500 

gtcaggaacg ttgaaggacc gagaaagggt gacgattgat caggaccgct gccggagcgc 16560 

aacccactca ctacagcaga gccatgtaga caacatcccc tccccctttc caccgcgtca 16620 

gacgcccgta gcagcccgct acgggctttt tcatgccctg ccctagcgtc caagcctcac 16680 

ggccgcgctc ggcctctctg gcggccttct ggcgctcttc cgcttcctcg ctcactgact 16740 
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cgctgcccic gctccttcgg ctgcggcgag cggtatcagc icacrcaaag qccgtaatac 16800 

ggttatccac agaarcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa 16860 

aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccatacgctc cccccccctg 16920 

acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa 16980 

gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc 17040 

ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgcttttc cgctgcataa 17100 

cccrgcttcg gggtcattat agcgattttt tcggtatatc catccttttt cgcacgatat 17160 

acaggatttt gccaaagggt tcgtgtagac tttccttggt gtatccaacg gcgtcagccg 17220 

ggcaggatag gtgaagtagg cccacccgcg agcgggtgtt ccttcttcac tgtcccttat 17280 

tcgcacctgg cggtgctca'a cgggaatcct gctctgcgag gctggccggc taccgccggc 17340 

gtaacagatg agggcaagcg gatggctgat gaaaccaagc caaccaggaa gggcagccca 17400 

cctatcaagg tgtactgcct tccagacgaa cgaagagcga ttgaggaaaa ggcggcggcg 17460 

gccggcatga gcctgtcggc ctacctgctg gccgtcggcc agggctacaa aatcacgggc 17520 

gtcgtggact ■ atgagcacgt ccgcgagctg gcccgcatca atggcgacct gggccgcctg 17580 

ggcggcctgc tgaaactctg gctcaccgac gacccgcgca cggcgcggtt cggtgatgcc 17640 

acgatcctcg ccctgctggc gaagatcgaa gagaagcagg acgagcttgg caaggtcatg 17700 

atgggcgtgg tccgcccgag ggcagagcca tgactttttt agccgctaaa acggccgggg 17760 

ggtgcgcgtg attgccaagc acgtccccat gcgctccatc aagaagagcg acttcgcgga 17820 

gctggtgaag tacatcaccg acgagcaagg caagaccgag cgcctttgcg acgctca 17877 

<210> 38 
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<2I1> 17238 

<212> DNA 

<2i3> Artificial 

<220> 

<223> Plasmic 
<220> 

<221> misc_f eature 

<222> (10264 ) . . (10264) 

<223> n- is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10472) . . (10472) 



<220> 

<221> misc_feature 
<222> (10563) . . (10563) 
<223> n is a, c, g, or t 

<400> 38 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 60 

aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 120 

aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 180 

ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 300 

caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 



BASF NAE 878/03 111 /357 

ttrtiaacca gggctccgcc ctgtgcgcgt gaccgcgcac gccgaagggg cgtgcccccc 600 

cttctccaac cc" cccggcc cgctaacgcg gqcciccca: ccccccaggg cctccccccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtcctrgc cartgccggg 720 

atcgggccag ~ aacgggatg ggcgatcagc ccgagcgcga cgcccggaag ca-cgacgtg 730 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

gctgccggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc • 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg . attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 
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attgttttac tacctagatg tggcgcaacg atgccggcga caagcaggac cccaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagiai zzcqgcaaqc 3 ISO 

ggtcgctggt attcgtgcag ggcaagattc ggaaiaccaa gtacgagaag gacgcccaga 3240 

cgctctacgc gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc aagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 
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qcczcazczc 



cctgcgaaga 



tgcattgcaa 



ctttactggc 



gctcgggacg 



tgtgat taag 



ccgattgtcg 



gaaaaagccc 



catcgacggc 



tcacaaggcg 



cggtatgctg 
tccaacggga 



ctggagcttg 



cgctgtgcag 



attgatggcg 



accaaacgca 



ggcgttcgga 



cgcctggcaa 



gccaatcccg 



agcgggttta 



ctatttcagc 

cagatcggat 

grrgcgaggc 

acgctagggc 

atttcaggaa 

cacggcgcgc 

get cagatt c 

gecctgaaga 

atggaggcgt 

gagatcattg 

catctgtccg 

ctgcgggcgt 

atctggtgga 

ttgtttattt 

ccgctgatgg 

gtcctggggg 

gegctagate 

accgtgctga 

ctggcggccg 

atgcctacag 

acctacttcc 



tcgcaccggg 
tccacccgcg 
ageggectgg 
cttgtggggt 
caagegggea 
tctacgaact 
gaeggcttgg 
aagctccaga 
tegctgaacg 
ggctgtcggt 
gcgttttcgt 
tgccggcggg 
tgegcatett 
cggtctaccg 
tcgtgttcat 
etatttgegg 
ctgtcggcgt 
cccgcaagtg 
gaggacttct 
gaaccaatgt 
tttggttccg 



agccgtaccc 
tgaagaagtg 
tggaacaege 
cagttccggc 
ctgctcgacg 
gecgataaac 
agcggccgac 
gatgttcggg 
gttgegagat 
cttcaaacag 
ggagcccgaa 
tttattgetc 
catcctcggc 
cctgccgggc 
ctctgccgct 
aactgeggge 
cgcagcgggc 
gcaacctccc 
gctcgttcca 
tctcggcctg 
ggggatctcg 



gctcaagctg 
gcgcgagcac 
cigggtcaai 
tgggggttca 
cacttgcttc 
agaggattaa 
gtgeaggatt 
tccgtttacg 
gccgtggcat 
gaggaeggee 
cagcgaggcc 
gtgatgatcg 
gcacttaata 
ggggtcgegg 
ctgctaggta 
gtggcgctgt 
ctggcggggg 
gtgcctctgc 
gtagctttag 
gcgtggctcg 
cgactcgaac 



caaaccttcc 
gucggcgaag 
gatgacc zcg 
gcagccagcg 
gctcagtatc 
aattgacaat 
tecgegagat 
agcacgagga 
tcggcgccta 
ccaaggacgc 
gaggggtege 
tccgacagat 
tttegctatt 
egaeggtagg 
gcccgatacg 
tggtgttgac 
eggtttccat 
tcacctttac 
tgtttgatcc 
gectgategg 
ctacagttgt 



4380 

4440 

4500 

4560 

4620 • 

4680 

4 74 0 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 
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ttccttactc ggct ttctca gccccagatc tggggtcgat cagccgcgga tgcatcaggc 5640 

cgacagrcgg aacttcgggt ccccgacctg taccattcgg tgagcaa-cgg anaggggagt 5700 

tcataicctc aacgttcact tctaaagaaa ragcgccact cagctrcctc agcggctrca 576C 

tccagcgatr tcctattatg ucggcat agt tctcaagatc gacagcctgt cacggttaag 5S20 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ' ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 
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ctcggtaccc gcggatcttt cgacactgaa atacctcgag cctgcuccgc t tggaagcgc 6840 

cgaggagcct cgtcctgtca caactaccaa catggagtac garaagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacitcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct . gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta.gctcaccaca aaagtcaga"c~gigcgt aacca aaagtcacac aacacaagct 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 
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accatttgt'a gaaaaatctg acqaactcgt gagctctgta cagtgaccgg tcacrctttc SIOO 

■ cccatgccc agagacgcac ggacgcagag agaagcgctg agtaataagc cactccccac 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc S220 

cgccccgaag t-ggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcaicgg S2S0 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 8460 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 
tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca • 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 
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ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg ccccaticgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc a-iatgcgcga ttgctcatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggcr tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcag.ag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg . cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 
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canttcctac taagatggta tacaatagta accatgcatg gttgcctagt caazcczccc 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga I06S0 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt ctaccgcttg 10S00 

gaccagtcca gtrccaggac cagtcgctcc acctcctccg ccgcacctgg aatgrgcrgc 10860 

agctcctgtg gacccaagaa cataccccag ggcgcgccac cgtacttgcc gctgtggtgt 10920 

agctggtggg ccactgtcag gcgcttcatg tagggcaggc cagcgatggg cccggtggga 10980 

aagcgcctgt gcaccaggcc atcgtgtaca aacatatatg ccatgccgta tagcgtgatg 11040 

cccagccccg ctccaaagca ggccgccccc aggacgttgg gcagccagaa gccaaaggta 11100 

cacaggagca tggcgggcag tccattgatg attgcaaaca agtcgttggc ttcaa'agggt 11160 

ccagtgcgag gtgtgtggtg gctcttgtgc agcagccagc ccagaggcga ctcatgccag 11220 

atggctttgt gtgcatagcg ggcatacatc tccatgccga gcgcgccacc aaccaccaag 11280 

aggagagtgc cagccacttc accccatggc actgcgccgc ccacggtcat gtgcatggca 11340 

aatctcaggt aggtggcgaa gatggcaatg cctgacacgc caattgatgc tgcaatggcg 11400 

gcagcctggt atgacagctg ctcccgtttg cgccgggcac gacgctctgc gatagcccgg 114 60 

tcaagctgct ggagtgctac atcggcgctg tgctcatcgc ccgcgccggc agcctgcacg 11520 

gttcccagcg cctcctctgt ctgtggtgct gccactcgca gccgaactaa cgagcaccgc 11580 

tgagcatgca ggcagacttt gggccgcgtg atgtcgcggg ctagttcaac gcggcgggcc 11640 

ttgacgctga ttgactgcag cttcgacagc atagagataa aataaaaaga gaagaaaaga 11700 

aagtttgtac aatttctttt tgtttatata acatacacgc tatgtcaaca tttagaataa 11760 

gggggaaaaa atcttccatc atattcgaat gcacaagatt atttctttgt tcgctctttt 11820 
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tggtcgggtc atcgagattt agagtgtaat caaagatact gtcatctcga gagcgttgca IISSO 

caggctcctg ttrgccaaat tggatgtttg ccgaattagt aaaatacgca agcatttctt 11940 

acctttccgc tcccttttcc taattctccc aaagactaaa tgaggaaaga taaaggacaa 12000 

agaaaatgta aagacaaaga aattgaaaac gatataaact tgcagcacgt aagaccaaag 12060 

caaattggta actattcttg tgtacaaaca tgtataaaaa aaaacttttt tttgctcctg 12120 

gaggacaaaa tttcaaactc cttgaagaag attgcttgta tatctatcat atgcatatat 12180 

catatcgatg gaaaaagaaa gtcaggcatg tatttataaa aagaagaatg tgccatgctt 12240 

ccgaatttct tttcactttc ttttccttat ctattttaat ctcaagcttg gcgtaatcat 12300 

ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag 12360 

ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg 12420 

cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa 12480- 

tcggccaacg cgcggggaga ggcggtttgc gtattgggcc aaagacaaaa gggcgacatt 12540 

caaccgattg agggagggaa ggtaaatatt gacggaaatt attcattaaa ggtgaattat 12600 

caccgtcacc gacttgagcc atttgggaat tagagccagc aaaatcacca gtagcaccat 12660 

taccattagc aaggccggaa acgtcaccaa tgaaaccatc gatagcagca ccgtaatcag 12.720 

tagcgacaga atcaagtttg cctttagcgt cagactgtag cgcgttttca tcggcatttt 12780 

cggtcatagc ccccttatta gcgtttgcca tcttttcata atcaaaatca ccggaaccag 12840 

agccaccacc ggaaccgcct ccctcagagc cgccaccctc agaaccgcca ccctcagagc 12900 

caccaccctc agagccgcca ccagaaccac caccagagcc gccgccagca ttgacaggag 12960 

gcccgatcta gtaacataga tgacaccgcg cgcgataatt tatcctagtt tgcgcgctat 13020 

attttgtttt ctatcgcgta ttaaatgtat aattgcggga ctctaatcat aaaaacccat 13080 



BASF NAE 878/03 

ctcataaata acgtcatgca ttacatgtta 
aattatatga taatcatccc aagaccgcca 
caaatgtttg aacgatcggg gatcatccgg 
gaacgcagca agatatcgcg gtgcatctcg 
ttgatgtgga cgccgggccc gatcatattg 
gccgttgctg tcgtaatgat atcggcacct 
gcgaagaact ccagcatgag atccccgcgc 
acgattccga agcccaacct ttcatagaag 
aggttgggcg tcgcttggtc ggtcatttcg 
caagaaggcg atagaaggcg atgcgctgcg 
ggaagcggtc agcccattcg ccgccaagct 
tgtcctgata gcggtccgcc acacccagcc 
cattttccac catgatattc ggcaagcagg 
cgtcgggcat gcgcgccttg agcctggcga 
cttcgtccag atcatcctga tcgacaagac 
tgcgatgttt cgcttggtgg tcgaatgggc 
gcattgcatc agccatgatg gatactttct 
cctgccccgg cacttcgccc aatagcagcc 
gcacagctgc gcaaggaacg cccgtcgtgg 
gcagttcatt cagggcaccg gacaggtcgg 
ctgacagccg gaacacggcg gcatcagagc 



121/357 

attattacat ccttaacgta attcaacaga 13140 

acaggattca atcttaagaa actttattcc 13200 

gtctgtggcg ggaactccac gaaaatatcc 13260 

gtcttgcctg ggcagtcgcc gccgacgccg 13320 

tcgctcagga tcgtggcgtt gtgcttgncg 13380 

tcgaccgcct gttccgcaga gatcccgtgg 13440 

tggaggatca tccagccggc gtcccggaaa 13500 

gcggcggtgg aatcgaaatc tcgtgatggc 13560 

aaccccagag tcccgctcag aagaactcgt 13620 

aatcgggagc ggcgataccg taaagcacga 13680 

cttcagcaat atcacgggta gccaacgcta 13740 

ggccacagtc gatgaatcca gaaaagcggc 13800 

catcgccatg ggtcacgacg agatcatcgc 13860 

acagttcggc tggcgcgagc ccctgatgct 13920 

cggcttccat ccgagtacgt gctcgctcga 13980 

aggtagccgg atcaagcgta tgcagccgcc 14040 

cggcaggagc aaggtgagat gacaggagat 14100 

agtcccttcc cgcttcagtg acaacgtcga 14160 

ccagccacga tagccgcgct gcctcgtcct 14220 

tcttgacaaa aagaaccggg cgcccctgcg 14280 

agccgattgt ctgttgtgcc cagtcatagc 14340 
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cgaatagcct ctccacccaa gcggccggag aacctgcgtc caatccatct tcttcaatca 14400 

tgcgaaacca rccagaiccg gtgcagatta tttggattga gagtgaatat gagactciaa 14460 

ttggataccg acgggaattt atggaacgtc agtggagcat ttttgacaag aaatatttgc 14520 

tagctganag tgaccttagg cgacttttga acgcgcaata atggtttctg acgtatgtgc 14580 

ttagctcatt aaactccaga aacccgcggc tgagtggctc ctrcaacgtt gcggttctgr 14640 

cagttccaaa cgtaaaacgg cttgtcccgc gtcatcggcg ggggtcataa cgtgactccc 14700 

ttaattcrcc gctcatgatc agattgtcgt ttcccgcctt cagtttaaac tatcagtgtt 14760 

tgacaggata tattggcggg taaacctaag agaaaagagc gtttattaga ataatcggat 14820 

atttaaaagg gcgtgaaaag gtttatccgt tcgtccattt gtatgtgcat gccaaccaca 14880 

gggttcccca gatctggcgc cggccag-ga gacgagcaag attggccgcc gcccgaaacg 14940 

atccgacagc gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag 15000 

cagaatgcca tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg 15060 

cagcaccggc ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac 15120 

gatcaggggt atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg 15180 

aacgcgcgga -ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa 15240 

gtgtcaagca tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg 15300 

aacgaggtcg gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag 15360 

cagccggcgc tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa 15420 

gccatgctgg cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca 15480 

tttctgatcg ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg 15540 

cgcatccatg ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt 15600 



BASF NAE 

cgcticctct 
acctacttcs 
cgcggcggca 
ttcgacgaag 
1 1 ggcgaaaa 
tcaggaccgc 
ctcccccttt 
gccctagcgt 
ccgcttcctc 
ctcactcaaa 
tgtgagcaaa 
tccataggct 
gaaacccgac 
ctcctgttcc 
tggcgctt tt 
ccatcctttt 
tgtatccaac 
tccttcttca 
ggctggccgg 
ccaaccagga 
attgaggaaa 



878/03 

gcgaggcggg 
ctgttggggc 
ccgttgaaca 
ccggtccgga 
ggaggctcgt 
tgccggagcg 
ccaccgcgtc 
ccaagcctca 
gctcactgac 
ggcggtaata 
aggccagcaa 
ccgcccccct 
aggactataa 
gaccctgccg 
ccgctgcata 
tcgcacgata 
ggcgtcagcc 
ctgtccctta 
ctaccgccgg 
agggcagccc 
aggcggcggc 



tttttcggcc 
cgt get tgag 
ggctccgctc 
cgcagcgttc 
tgtcaggaac 
caacccactc 
agacgcccgt 
cggccgcgct 
tcgctgcgct 
eggttatcca 
aaggccagga 
gacgagcatc 
agataccagg 
ettaceggat 
accctgcttc 
tacaggattt 
gggcaggata 
ttcgcacctg 
cgtaacagat 
acctatcaag 
ggccggcatg 



123/3E 

ggggacgccg 

gagcaggecg 

tcgccgctgt 

gagcagggac . 

gttgaaggac 

actacagcag 

agcagcccgc 

cggcctctct 

eggt cgttcg 

cagaatcagg 

acegtaaaaa 

acaaaaatcg 

cgtttccccc 

acctgtccgc 

ggggtcatta 

tgccaaaggg 

ggtgaagtag 

gcggtgctca 

gagggcaagc 

gtgtactgcc 

agectgtegg 



teaatgeget 
gegacagega 
tgcgggccgc 
tegeggtgat 
cgagaaaggg 
agecatgtag 
tacgggcttt 
ggcggccttc 
getgeggega 
ggataacgea 
ggccgcgttg 
aegctcaagt 
tggaagctcc 
ctttctccct 
tagegatttt 
ttcgtgtaga 
gcccacccgc 
aegggaatec 
ggatggctga 
ttccagacga 
cctacctgct 



oatgacaatc 
tgeeggegag 
gatagacgee 
tgtcgatgga 
tgacgattga 
acaacatccc 
ttcatgccct 
tggegctett 
gcggtatcag 
ggaaagaaca 
ctggcgtttt 
cagaggtggc 
ctcgtgcgct 
tegggaageg 
tteggtatat 
ctttccttgg 
gagcgggtgt 
tgetctgega 
tgaaaccaag 
aegaagageg 
ggccgtcggc 



15 660 
15720 
15780 
I5S40 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
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cagggctaca aaatcacggg cgtcgtggac tatgagcacg tccccgacct gccccgcatc 16920 

aatggcgacc tgggccgcct gggcggcctg ctgaaactct gcctcaccga cgacccgcgc 16980 

acggcgcggt tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag 17040 

gacgagcttg gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt 17100 

tagccgctaa aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat 17160 

caagaagagc gacttcgcgg agctgg.tgaa gtacatcacc gacgagcaag gcaagaccga 17220 

gcgcctttgc gacgctca 17238 

39 

17238 
DNA 

Artificial 
Plasmid 



misc_f eature 
(10264 ) . . (10264 ) 
n is a, c, g, or t 

misc_f eature 
(10472) . . (10472) 
n is a, c, g, or t 

misc_f eature 
(10563) . . (10563) 
n is a, c, g, or t 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



<400> 39 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 



60 
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eaccgccgtc gaagccgtgt gcgagacacc gcgccccccc gcgttgtgga cacctcccqc 120 

aaaacttggc ccrcactgac agatgagggg cggacgttca cacttgaggg ccccactcac ISO 

ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatigtgca 300 

caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 

atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgrg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 
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atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac Utgtcarqc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccac ctgcrgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctcattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

i 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 

tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 

ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa g'gaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 2460 

catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 
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tcaagatgaa caaaqccctg aaaagattat cgagctctat gcgcagtcca tcaggctctt 2640 

tcaciccatc gaca-catcgg attgtcccta tacgaatagc tiagacaccc gcttaaccga 2700 

attggartac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactcca-.tt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga. 2S20 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2SS0 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 
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ccctgrtcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggicaiit 3900 

tccacgtcaa caagcacgtg aagatcacct acaccggcgt cgagcicccg gccgacgatg 3960 

acgaacrggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccqgtatt 40S0 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 

gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4680 

tgtgattaag .gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 
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ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggcct cccgc cgacggtacg 5160 

cgcrgtccag ccccigargo tcgtgttcat ctctgccgct ctgct accta gcccgatacg 5220 

atngatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggrgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtitccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 5460 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 
tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta • 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 
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ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacct caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 64 8 0 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg arttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc .tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 
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tciciagrta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 76S0 

gtaaggartt cggcacggct acggaagacg gagaagccac cttcagtgqa ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccaicaaagt 7800 

cgtataccta ccagrgagga agtggactca aatcgacttc agcaacatct cctggataaa 7S60 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt cafgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 8460 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 
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tttgaacctt ccagttcgag ctttcccact tcatcgcagc ttgactaaca gczaaccccc S 94 0 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt tuctgarcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgrgcttr 9060 

cagcttcgar graggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgaca agatacgttc atttgtccaa gcagcaaaga 10140 
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gtcccttcta gtgarttaat agctccatgt caacaagaat aaaacccgit ttcgcgtcta 10200 

cctcrtccac atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcati ttcgggagac 10320 

cagancaagc agatcaacgg tcgtcaagag acctacgaga .ctgaggaatc cgctcttgcc 103S0 

tccacgcgac tatatatttg tctctaattg tacrttgaca tgctcctctt ctrtactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt agagataaaa 10800 

taaaaagaga agaaaagaaa gtttgtacaa tttctttttg tttatataac atacacgcta 10860 

tgtcaacatt tagaataagg gggaaaaaat cttccatcat attcgaatgc acaagattat 10920 

ttctttgttc gctctttttg gtcgggtcat cgagatttag agtgtaatca aagatactgt 10980 

catctcgaga gcgttgcaca ggctgctgtt tgccaaattg gatgtttgcc gaattagtaa 11040 

aatacgcaag catttcttac ctttccgctc ccttttccta attctcccaa agactaaatg 11100 

aggaaagata aaggacaaag aaaatgtaaa gacaaagaaa ttgaaaacga tataaacttg 11160 

cagcacgtaa gaccaaagca aattggtaac tattcttgtg tacaaacatg tataaaaaaa 11220 

aacttttttt tgctcctgga ggacaaaatt tcaaactcct tgaagaagat tgcttgtata 11280 

tctatcatat gcatatatca tatcgatgga aaaagaaagt caggcatgta tttataaaaa 11340 

gaagaatgtg ccatgcttcc gaatttcttt tcactttctt ttccttatct attttaatct 11400 
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catgctgncg aacctgcagt caatcagcgt caaggcccgc cgcgtt.gaac tagcccgcga 11460 

catcacgcgc cccaaagtct gcctgcatgc tcagccgtgc tcgttagttc ggcrgcgagt I 1520 

ggcagcacca cagacagagg aggcgctggg aaccgtgcag gctgccggcg cgggcgatga 11580 

gcacagcgcc gatgtagcac tccagcagct tgaccgggct atcgcagagc gtcgtgcccg 11640 

gcgcaaacgg gagcagctgt cataccaggc tgccgccatt gcagcatcaa ttggcgtgtc 11700 

aggcattgcc atcttcgcca cctacctgag atttgccatg cacatgaccg tgggcggcgc 11760 

agtgccatgg ggtgaagtgg ctggcactct cctcttggtg gttggtggcg cgctcggcat 11820 

ggagatgtat gcccgctatg cacacaaagc catctggcat gagtcgcctc tgggctggct 11880 

gctgcacaag agccaccaca cacctcgcac tggacccttt gaagccaacg acttgtttgc 11940 

aatcatcaat ggactgcccg ccatgctcct gtgtaccttt ggcttctggc tgcccaacgt 12000 

cctgggggcg gcctgctttg gagcggggct gggcatcacg ctatacggca tggcatatat 12060 

gtttgtacac gatggcctgg tgcacaggcg ctttcccacc gggcccatcg ctggcctgcc 12120 

ctacatgaag cgcctgacag tggcccacca gctacaccac agcggcaagt acggtggcgc 12180 

gccctggggt atgttcttgg gtccacagga gctgcagcac attccaggtg cggcggagga 12240 

ggtggagcga ctggtcctgg aactggactg gtccaagcgg tagaagcttg gcgtaatcat 12300 

ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag 12360 

ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg 12420 

cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa 12480 

tcggccaacg cgcggggaga ggcggtttgc gtattgggcc aaagacaaaa gggcgacatt 12540 

caaccgattg agggagggaa ggtaaatatt gacggaaatt attcattaaa ggtgaattat 12600 

caccgtcacc gacttgagcc atttgggaat tagagccagc aaaatcacca gtagcaccat 12660 
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tac cartage 
taccgacaga 
eggtcatage 
acccaccacc 
caccaccct c 
gcccgatcta 
attttgtttt 
ctcataaata 
aattatatga 
caaatgtttg 
gaaegcagea 
ttgatgtgga 
gccgttgctg 
gcgaagaact 
acgat t ccga 
aggttgggcg 
caagaaggcg 
ggaagcggtc 
tgtcctgata 
cattttccac 
egtegggcat 



aaggccggaa 
atcaactttg 
ccccttatta 
ggaaccgcct 
agagccgcca 
gtaacataga 
etatcgegta 
aegtcatgea 
taatcat cgc 
aacgateggg 
agatatcgeg 
cgccgggccc 
tegtaatgat 
ccagcatgag 
agcccaacct 
tcgcttggtc 
atagaaggcg 
agcccattcg 
gcggtccgcc 
catgatattc 
gcgcgccttg 



acgtcaccaa 
ectttagegt 
gcgtttgcca 
ccctcagagc 
ccagaaccac 
tgacaccgcg 
ttaaatgtat 
ttacatgtta 
aagaceggea 
gatcatcegg 
gtgeatcteg 
gatcatattg 
atcggcacct 
atccccgcgc 
ttcatagaag 
ggtcatttcg 
atgcgctgcg 
ccgccaagct 
acacccagcc 
ggcaagcagg 
agectggega 



tgaaaccaic 
cagactgt ag 
tcttttcata 
cgccaccctc 
caccagagcc 
cgegataatt 
aattgcggga 
attattacat 
acaggattca 
gtctgtggcg 
gtcttgcctg 
tegctcagga 
tcgaccgcct 
tggaggatca 
gcggcggtgg 
aaccccagag 
aategggage 
cttcagcaat 
ggccacagtc 
catcgccatg 
acagttegge 



catagcagca 

cgcgttt rca 

a tcaaaa nca 

agaaccgcca 

gccgccagca 

tatcctagtt 

ctctaatcat 

gettaaegta 

atcttaagaa 

ggaactccac 

ggcagtcgcc 

tcgtggcgtt 

gttcegcaga 

tccagccggc 

aatcgaaatc 

tcccgctcag 

ggcgataccg 

ateaegggta 

gatgaatcca 

ggtcacgacg 

tggegegage 



ccgtaatcac 

t eggcat t t t 

ccagaaccac 

ccctcagagc 

t tgacaggag 

tgegegctat 

aaaaacccat 

attcaacaga 

actttattgc 

gaaaatatcc 

gccgacgccg 

gtgcttgtcg 

gatcccgtgg 

gtcccggaaa 

tcgtgatggc 

aagaactcgt 

taaagcacga 

gccaacgcta 

gaaaagegge 

agatcatege 

ccctgatgct 



12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
•13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
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cttcgtccag atcatcctga tcgacaagac cggcttccat ccgagtacgt qctcgctcga 139S0 

tgcgatgttt cgcttggtgg tcgaatgggc aggtagccgg atcaagcgta iccagccgcc 14040 

gcattgcatc agccatgatg gatactttct cggcaggagc aaggtgagat gacaggagat I4I00 

ccrgccccgg cacttcgccc aatagcagcc agtcccttcc cgcttcagtg acaacgtcga 14160 

gcacagctgc gcaaggaacg cccgtcgtgg ccagccacga tagccgcgct gcctcgtcct 14220 

gcagttcatt cagggcaccg gacaggtcgg tcttgacaaa aagaaccggg cgcccctgcg 14280 

ctgacagccg gaacacggcg gcatcagagc agccgattgt ctgttgtgcc cagtcatagc 14340 

cgaatagcct ctccacccaa gcggccggsg aacctgcgtg caatccatct tgttcaatca 14400 

tgcgaaacga tccagatccg gtgcagatta tttggattga gagtgaatat gagactctaa 14460 

ttggataccg aggggaattt atggaacgtc agtggagcat ttttgacaag aaatatttgc 14520 

tagctgatag tgaccttagg cgacttttga acgcgcaata atggtttctg acgtatgtgc 14580 

ttagctcatt aaactccaga aacccgcggc tgagtggctc cttcaacgtt gcggttctgt 14640 

cagttccaaa cgtaaaacgg cttgtcccgc gtcatcggcg ggggtcataa cgtgactccc 14700 

ttaattctcc gctcatgatc agattgtcgt ttcccgcctt cagtttaaac tatcagtgtt 14760 

tgacaggata tattggcggg taaacctaag agaaaagagc gtttattaga ataatcggat 14820 

atttaaaagg gcgtgaaaag gtttatccgt tcgtccattt gtatgtgcat gccaaccaca 14880 

gggttcccca gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg 14 940 

atccgacagc gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag 15000 

cagaatgcca tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg 15060 

cagcaccggc ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac 15120 

gatcaggggt atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg 15180 
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aacgcgcgga ttctttatca ctgataagtt ggtggacata ttatgtttat cactgataaa 15240 

gtgtcaagca tgacaaactt gcagccgaat acagtgatcc gtgccgccct ccacctcttc 15300 

aacgaggtcg gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag 15360 

cagccggcgc tittactggca cttcaggaac aagcgggcgc .tgcucgacgc actggccgaa 15420 

gccatgctgg cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca 15480 

tttctgatcg ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg 15540 

cgcatccatg ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt 15600 

cgcttcctct gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc- 15660 

agctacttca ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag 15720 

cgcggcggca ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc 15780 

ttcgacgaag ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga 15840 

ttggcgaaaa ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga 15900 

tcaggaccgc tgccggagcg caacccactc actacagcag agccatgtag acaacatccc 15960 

ctcccccttt ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct 16020 

gccctagcgt ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt 16080 

ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag 16140 

ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca 16200 

tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt 16260 

tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc 16320 

gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct 16380 

ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg 16440 
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tggcgctttt ccgctgcata accctgcttc ggggtcatta tagcgatttt tccggratat 16500 

ccatccttm rcgcacgata tacacgattt tgccaaaggq ttcgtgtaga cttrccrtgg 16560 

tgtatccaac ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggrgt 16620 

tccrtcttca ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgcrctgcga 16680 

ggctggccgg ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag 16740 

ccaaccagga agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg 16800 

attgaggaaa aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc 16860 

cagggctaca aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc 16920 

aatggcgacc tgggccgcct gggcggcccg ctgaaactct ggctcaccga cgacccgcgc 16980 

acggcgcggt tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag 17040 

gacgagcttg gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt 17100 

tagccgctaa aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat 17160 

caagaagagc gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga 17220 

gcgcctttgc gacgctca 17238 



<210> 40 

<211> 18449 

<212> DNA 

<213> Artificial 



<220> 

<223> Plasmid 



<220> 

<221> misc_f eature 

<222> (3471) . . (3471) 

<223> n is a, c, g, or t 
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<220> 

<221> misc_f eaiure 

<222> (3679) . . ( 3679) 

<223> n is a, c, g, or t 

<220> 

<22 1 > rnisc_f eature 

<222> (3770) . . (3770) 

<223> n is a, c, q, or t 

<400> 40 

gatctrtcga'cacigaaata cgtcgagcct gctccgcttg gaagcggcga ggagcctcgt 60 

cctgtcacaa ctaccaacat ggagtacgat aagggccagt tccgccagct cattaagagc 120 

cagttcatgg gcgttggcat gatggccgtc atgcatctgt acttcaagta caccaacgct 180 

cttctgatcc agtcgatcat ccgctgaagg cgctttcgaa tctggttaag atccacgtct 240 

tcgggaagcc agcgactggt gacctccagc gtccctttaa ggctgccaac agctttctca 300 

gccagggcca gcccaagacc gacaaggcct ccctccagaa cgccgagaag aactggaggg 360 

gtggtgtcaa ggaggagtaa gctccttatt gaagtcggag gacggagcgg tgtcaagagg 420 

atattcttcg actctgtatt atagataaga tgatgaggaa ttggaggtag catagcttca 480 

tttggatttg ctttccaggc tgagactcta gcttggagca tagagggtcc tttggctttc 540 

aatattctca agtatctcga gtttgaactt attccctgtg aaccttttat tcaccaatga 600 

gcattggaat gaacatgaat ctgaggactg caatcgccat gaggttttcg aaatacatcc 660 

ggatgtcgaa ggcttggggc acctgcgttg gttgaattta gaacgtggca ctattgatca 720 

tccgatagct ctgcaaaggg cgttgcacaa tgcaagtcaa acgttgctag cagttccagg 780 

tggaatgtta tgatgagcat tgtattaaat caggagatat agcatgatct ctagttagct 840 

caccacaaaa gtcagacggc gtaaccaaaa gtcacacaac acaagctgta aggatttcgg 900 



BASF NAE 878/03 140/357 

cacgcctacg gaagacggag aagccacctt cagtggactc gactaccart uaanctait 960 

ictamga: cgagacctaa tacagcccct acaacgacca rcaaagtcgt atagctacca 1020 

gtgaggaagt ggactcaaat cgacttcagc aacatctcct ggataaactt taagcctaaa 10S0 

ctatacagaa taagataggt ggagagctta taccgagctc ccaaatctgt ccagatcatg 1140 

gttgaccggt gccrggatct tcctatagaa tcatccttat tcgttgacct agctgattct 1200 

ggagtgaccc agagggtcat gacttgagcc taaaatccgc cgcctccacc atttgtagaa 1260 

aaatgtgacg aactcgtgag ctctgtacag tgaccggtga ctctttctgg catgcggaga 1320 

gacggacgga cgcagagaga agggctgagt aataagccac tggccagaca gctctggcgg 1380 

ctctgaggtg cagtggatga ttattaatcc gggaccggcc gcccctccgc cccgaagtgg 1440 

aaaggctggt gtgcccctcg ttgaccaaga atctattgca tcatcggaga atatggagct 1500 

tcatcgaatc accggcagta agcgaaggag aatgtgaagc caggggtgta tagccgtcgg 1560 

cgaaatagca tgccattaac ctaggtacag aagtccaatt gcttccgatc tggtaaaaga 1620 

ttcacgagat agtaccttct ccgaagtagg tagagcgagt acccggcgcg taagctccct 1680 

aattggccca tccggcatct gtagggcgtc caaatatcgt gcctctcctg ctttgcccgg 1740 

tgtatgaaac • cggaaaggcc gctcaggagc tggccagcgg cgcagaccgg gaacacaagc 1800 

tggcagtcga cccatccggt gctctgcact cgacctgctg aggtccctca gtccctggta 1860 

ggcagctttg ccccgtctgt ccgcccggtg tgtcggcggg gttgacaagg tcgttgcgtc 1920 

agtccaacat ttgttgccat attttcctgc tctccccacc agctgctctt ttcttttctc 1980 

tttcttttcc catcttcagt atattcatct tcccatccaa gaacctttat ttcccctaag 2040 

taagtacttt gctacatcca tactccatcc ttcccatccc ttattccttt gaacctttca 2100 

gttcgagctt tcccacttca tcgcagcttg actaacagct accccgcttg agcagacatc 2160 
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accargcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa gttcgacagc 2220 

ctctccgacc tgatgcagct ctcggagggc gaagaatctc gtgctttcag cttcgatcta 22S0 

ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg atggtttcta caaaqatcgt 2340 

tatgtttatc ggcactttgc atcggccgcg ctcccgattc cggaagtgcr tgacarrggg 2400 

gaattcagcg agagcctgac ctattgcatc tcccgccgtg cacagggtgt cacgttgcaa 24 60 

gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg 2520 

atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc 2580 

ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac 2640 

tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg 2700 

atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga tttcggctcc 2760 

aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag cgaggcgatg • 2820 

ttcggggatt cccaatacga ggtcgccaac atcttcttct ggaggccgtg gttggcttgt 2880 

atggagcagc agacgcgcta cttcgagcgg aggcatccgg agcttgcagg atcgccgcgg 2940 

ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt ggttgacggc 3000 

aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg atccggagcc 3060 

gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac cgatggctgt 3120 

gtagaagtac tcgccgatag tggaaaccga cgccccagca ctcgtccgag ggcaaaggaa 3180 

tagagtagat gccgaccgcg ggatcgatcc acttaacgtt actgaaatca tcaaacagct 3240 

tgacgaatct ggatataaga tcgttggtgt cgatgtcagc tccggagttg agacaaatgg 3300 

tgttcaggat ctcgataaga tacgttcatt tgtccaagca gcaaagagtg ccttctagtg 3360 

atttaatagc tccatgtcaa caagaataaa acgcgttttc gggtttacct cttccagata 3420 
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cagctcatct 
gcgaagacaa 
icaacgctcg 
atatttgtct 
gaaaattccg 
ctcaagccta 
gatggtatac 
gccggccgaa 
cttgtttaga 
ctccacatct 
agaaagtgaa 
ctttcttttt 
ggagtttgaa 
acaagaatag 
ttctttgtct 
aggaaaaggg 
aatttggcaa 
taaatctcga 
tgatggaaga 
aaaaagaaat 
gctgcagtca 



gcaatgcatt 
ga at: agct t a 
tcaagagacc 
ctaattgtac 
tcaccagcnc 
caggacacac 
aatagtaacc 
acttttttac 
ggtaatcctt 
ccactcgacc 
aagaaattcg 
ccatcgatat 
attttgtcct 
ttaccaattt 
ttacattttc 
agcggaaagg 
acagcagcct 
tgacccgacc 
ttttttcccc 
tgtacaaact 
atcagcgtca 



aatgcattga 
gcagagctat 
tacgagactg 
tttgacatgc 
ctgggttcgc 
attcatcgta 
atgcatggtt 
aactctccta 
ctttctagct 
tgcaggcatg 
gaagcatggc 
gatatatgca 
ccaggagcaa 
gctttggtct 
tttgtccttt 
taagaaatgc 
gtgcaacgct 
aaaaagagcg 
cttattctaa 
ttcttttctt 
aggcccgccg 



ctgcaaccta 
tttcattttc 
aggaatccgc 
tcctcttctt 
aaagataatt 
ggtat aaacc 
gcctagtgaa 
tgagtcgttt 
agaagtcctc 
caaagcttga 
acattcttct 
tatgatagat 
aaaaaagttt 
tacgtgctgc 
atctttcctc 
ttgcgtattt 
ctcgagatga 
aacaaagaaa 
atgttgacat 
ctctttttat 
cgttgaacta 



gtaacgcctr 

gggagacgag 

tcttggctcc 

tactctgata 

gcatgtttct 

tcgaaatcan 

tgctccgtaa 

acccagaatg 

gtgtactgtg 

gattaaaata 

ttttataaat 

atacaagcaa 

ttttttatac 

aagtttatat 

atttagtctt 

tactaattcg 

cagtatcttt 

taatcttgtg 

agcgtgtatg 

tttatctcta 

gcccgcgaca 



ncaggctccc 

atcaa ocaca 

acgcgactat 

gcttgactat 

tcctt gaact 

ttcct actaa 

cacccaatac 

cacaggtaca 

taagcgccca 

gataa ggaaa 

acatgcctga 

tcttcttcaa 

atgtttgtac 

cgttttcaat 

tgggagaatt 

gcaaacatcc 

gattacactc 

cattcgaata 

ttatataaac 

tgctgtcgaa 

tcacgcggcc 



3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 
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caaagtctgc ctgcatgctc agcggtgctc gttagttcgg ctgcgagtgg cagcaccaca 4740 

gacagaggag gcgctgggaa ccgtgcaggc tgccggcgcg cgcgatgagc acagcgccga 4 S00 

tgtagcact.c cagcagcttg accgggctat cgcagagcgt cgtgcccggc gcaaacggga 4860 

gcagctgtca taccaggctg ccgccattgc agcatcaatt ggcgtgrcag gcattgccat 4920 

cttcgccacc tacctgagat ttgccatgca catgaccgtg ggcggcgcag rgccatgggg 4 9S0 

tgaagtggct ggcactctcc tcttggtggt tggtggcgcg ctcggcatgg agatgtatgc 5040 

ccgctatgca cacaaagcca tctggcatga gtcgcctctg ggctggctgc tgcacaagag 5100 

ccaccacaca cctcgcactg gaccctttga agccaacgac ttgtttgcaa tcatcaatgg 5160 

actgcccgcc atgctcctgt gtacctttgg cttctggctg cccaacgtcc tgggggcggc 5220 

ctgctttgga gcggggctgg gcatcacgct atacggcatg gcatatatgt ttgta'cacga 5280 

tggcctggt-g cacaggcgct ttcccaccgg gcccatcgct ggcctgccct acatgaagcg 5340 

cctgacagtg gcccaccagc tacaccacag cggcaagtac ggtggcgcgc cctggggtat 5400 

gttcttgggt ccacaggagc tgcagcacat tccaggtgcg gcggaggagg tggagcgact 54 60 

ggtcctggaa ctggactggt ccaagcggta gattgtgact gatagcgaga ctctgggtcg 5520 

atgttatctg cctcaacaat ggcttagaaa agaagaaaca gaacaaatac agcaaggcaa 5580 

cgcccgtagc ctaggtgatc aaagactgtt gggcttgtct ctgaagcttg taggaaaggc 5640 

agacgctatc atggtgagag ctaagaaggg cattgacaag ttgccggcaa actgtcaagg 5700 

cggtgtacga gctgcttgcc aagtatatgc tgcaattgga tctgtactca agcagcagaa 5760 

gacaacatat cctacaagag ctcatctaaa aggaagcgaa cgtgccaaga ttgctctgtt 5820 

gagtgtatac aacctctatc aatctgaaga caagcctgtg gctctccgtc aagctagaaa 5880 

gattaagagt ttttttgttg attagtgaat ttttgtttta tttatgtctg atagttcaat 5940 
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aaagagacaa cacatacaat ataaaatcat tctctrtaaa tgttaatuua gtagagccta 6000 

aagcctccat ttttrttgta cgcataaaca atgaantcac cccgcrtctg grriiiaaat 6060 

aattatgtca aactagggaa aattcttttt tttctcttcg ttcttttttt ggcttgttgt 6120 

ggagtcacag qczzgtcztc agattgatag aggttgtata cactcaacag agcaacctrg 6180 

gcacgttcgc ttccttttag atgagctctt gtaggatatg ttgtcttctg ctgcttgagt 6240 

acagatccaa ttgcagcata tacttggcaa gcagctcgta caccgccttg acagtttgcc 6300 

ggcaacttgt caatgccctt cttagctctc accatgatag cgtctgcctt tcctacaagc 6360 

ttcagagaca agcccaacag tctttgatca cctaggctac gggcgttgcc ttgctgtatt 6420 

tgttctgttt cttcttttct aagccattgt tgaggcagat aacatcgacc caacatcctc 6480 

gagccatact acagcataaa aggatacgtt ttctttaaca gaaatttacc cttttgttat 6540 

cagcacatac aaaaaaaaag aaatttaaga tgagtaggac ttccattctc tcaaaaattt 6600 ■ 

tattcaatcc ataaatgaat tatttttgga caaaaaagaa agattatgcc tgattttctc 6660 

tatttttttt ttttttacaa ctccaccaat actttctagc ccagcttggc gtaatcatgg 6720 

tcatagctgt ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc 6780 

ggaagcataa agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg 6840 

ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc 6900 

ggccaacgcg cggggagagg cggtttgcgt attgggccaa agacaaaagg gcgacattca 6960 

accgattgag ggagggaagg taaatattga cggaaattat tcattaaagg tgaattatca 7020 

ccgtcaccga cttgagccat ttgggaatta gagccagcaa aatcaccagt agcaccatta 7080 

ccattagcaa ggccggaaac gtcaccaatg aaaccatcga tagcagcacc gtaatcagta 7140 

gcgacagaat caagtttgcc tttagcgtca gactgtagcg cgttttcatc ggcattttcg 7200 
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ctcatagccc ccttattagc gtttgccatc ttttcataat caaaatcacc cgaaccagag 7260 

ccaccaccgg aacccccicc ctcagagccg ccaccctcag aaccgccacc ctcagaccca 7320 

ccaccctcag agccgccacc agaaccacca ccagagccgc cgccagcatt gacaggaggc 7380 

ccgatctagt aacatagatg acaccgcgcg cgataattta tcctagtttg cgcgctatai 7440 

tttgttrtct atcgcgtatt aaatgtataa ttgcgggact ctaatcataa aaacccarct 7500 

cataaataac gtcatgcatt acatgttaat tattacatgc ttaacgtaat tcaacagaaa 7560 

ttatatgata atcatcgcaa gaccggcaac aggattcaat cttaagaaac tttattgcca 7620 

aatgtttgaa cgatcgggga tcatccgggt. ctgtggcggg aactccacga aaatatccga 7680 

acgcagcaag atatcgcggt gcatctcggt cttgcctggg cagtcgccgc cgacgccgtt 7740 

gatgtggacg ccgggcccga tcatattgtc gctcaggatc gtggcgttgt gcttgtcggc 7800 

cgttgctgtc gtaatgatat cggcaccttc gaccgcctgt tccgcagaga tcccgtgggc 7860 

gaagaactcc agcatgagat ccccgcgctg gaggatcatc cagccggcgt cccggaaaac 7920 

gattccgaag cccaaccttt catagaaggc ggcggtggaa tcgaaatctc gtgatggcag 7980 

gttgggcgtc gcttggtcgg tcatttcgaa ccccagagtc ccgctcagaa gaactcgtca 8040 

agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg cgataccgta aagcacgagg 8100 

aagcggtcag cccattcgcc gccaagctct tcagcaatat cacgggtagc caacgctatg 8160 

tcctgatagc ggtccgccac acccagccgg ccacagtcga tgaatccaga aaagcggcca 8220 

ttttccacca tgatattcgg caagcaggca tcgccatggg tcacgacgag atcatcgccg 8280 

tcgggcatgc gcgccttgag cctggcgaac agttcggctg gcgcgagccc ctgatgctct 8340 

tcgtccagat catcctgatc gacaagaccg gcttccatcc gagtacgtgc tcgctcgatg 8400 

cgatgtttcg cttggtggtc gaatgggcag gtagccggat caagcgtatg cagccgccgc 8460 
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attgcaicag ccatgatgga tactttctcg gcaggagcaa ggtgagatga caggagatcc 8520 

tgccccggca cttcgcccaa tagcagccag tcccttcccg cttcagtgac aacgtzcgagc 85SC 

acagctgcgc aaggaacgcc cgtcgtggcc agccacgata gccgcgctgc ctcgtcctgc 8640 

agutcatuca gcgcaccgga caggtcggtc ttgacaaaaa gaaccgggcg cccctgcgct 8700 

gacagccgga acacggcggc atcagagcag ccgattgtct gttgtgccca gtcatagccg 8760 

aatagcctct ccacccaagc ggccggagaa cctgcgtgca atccatcttg ttcaatcatg 8820 

cgaaacgatc cagatccggt gcagattatt tggattgaga gtgaatatga gactctaatt 8880 

ggataccgag gggaatttat ggaacgtcag tggagcattt ttgacaagaa atatttgcta 8940 

gctgatagtg accttaggcg acttttgaac gcgcaataat ggtttctgac gtatgtgctt 9000 

agctcattaa actccagaaa cccgcggctg agtggctcct tcaacgttgc ggttctgtca 9060 

gttccaaacg taaaacggct tgtcccgcgt catcggcggg ggtcataacg tgactccctt 9120 

aattctccgc tcatgatcag attgtcgttt cccgccttca gtttaaacta tcagtgtttg 9180 

acaggatata ttggcgggta aacctaagag aaaagagcgt ttattagaat aatcggatat 9240 

ttaaaagggc gtgaaaaggt ttatccgttc gtccatttgt atgtgcatgc caaccacagg 9300 

gttccccaga tctggcgccg gccagcgaga cgagcaagat tggccgccgc ccgaaacgat 9360 

ccgacagcgc gcccagcaca ggtgcgcagg caaattgcac caacgcatac agcgccagca 9420 

gaatgccata gtgggcggtg acgtcgttcg agtgaaccag atcgcgcagg aggcccggca 9480 

gcaccggcat aatcaggccg atgccgacag cgtcgagcgc gacagtgctc agaattacga 9540 

tcaggggtat gttgggtttc acgtctggcc tccggaccag cctccgctgg tccgattgaa 9600 

cgcgcggatt ctttatcact gataagttgg tggacatatt atgtttatca gtgataaagt 9660 

gtcaagcatg acaaagttgc agccgaatac agtgatccgt gccgccctgg acctgttgaa 9720 
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ccagctcggc gtagacggtc tgacgacacg caaactggcg gaacggttgg gggttcacca 9780 

gccggcgctt tactggcact tcaggaacaa gcgggcgctg ctcgacgcac tggccgaagc 9840 

catgctggcg gagaatcata cgcattcggt gccgagagcc gacgacgact ggcgcrcati: 9900 

tctgatcggg aatgcccgca gcttcaggca ggcgctgctc gcctaccgcg atggcgcgcg 9960 

catccatgcc ggcacgcgac cgggcgcacc gcagatggaa acggccgacg cgcagcttcg 10020 

cttcctctgc gaggcgggtt tttcggccgg ggacgccgtc aatgcgctga tgacaatcag 10080 

ctacttcact gttggggccg tgcttgagga gcaggccggc gacagcgatg ccggcgagcg 10140 

cggcggcacc gttgaacagg ctccgctctc gccgctgttg cgggccgcga tagacgcctt 10200 

cgacgaagcc ggtccggacg cagcgttcga gcagggactc gcggtgattg tcgatggatt 10260 

ggcgaaaagg aggctcgttg tcaggaacgt tgaaggaccg agaaagggtg acgattgatc 10320 

aggaccgctg ccggagcgca acccactcac tacagcagag ccatgtagac aacatcccct 10380 

ccccctttcc accgcgtcag acgcccgtag cagcccgcta cgggcttttt catgccctgc 10440 

cctagcgtcc aagcctcacg gccgcgctcg gcctctctgg cggccttctg gcgctcttcc 10500 

gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 10560 

cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 10620 

tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 10680 

cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 10740 

aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 10800 

cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 10860 

gcgcttttcc gctgcataac cctgcttcgg ggtcattata gcgatttttt cggtatatcc 10920 

atcctttttc gcacgatata caggattttg ccaaagggtt cgtgtagact ttccttggtg 10980 
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tatccaacgg cgtcagccgg gcaggatagg tcaagtaggc ccacccgcga gcgggtgttc 11040 

cttcttcact gtcccttatt cgcacctggc ggtgctcaac gggaa^ccrg ctctgcgacg 11100 

ctggccggct accgccggcg taacagatga gggcaagcgg atggctgatg aaaccaagcc 1 1 i 60 

aaccaggaag ggcagcccac ctatcaaggt gtactgcctt ccagacgaac gaagagcgat 11220 

tgaggaaaag gcggcggcgg ccggcatgag cctgtcggcc tacctgctgg ccgtcggcca 11280 

gggctacaaa atcacgggcg tcgtggacta tgagcacgtc cgcgagctgg cccgcatcaa 11340 

tggcgacctg ■ ggccgcctgg gcggcctgct gaaactctgg ctcaccgacg acccgcgcac 11400 

ggcgcggttc ggtgatgcca cgatcctcgc cctgctggcg aagatcgaag agaagcagga 11460 

cgagcttggc aaggtcatga tgggcgtggt ccgcccgagg gcagagccat gactttttta 11520 

gccgctaaaa cggccggggg gtgcgcgtga ttgccaagca cgtccccatg cgctccatca 11580 

agaagagcga cttcgcggag ctggtgaagt acatcaccga cgagcaaggc aagaccgagc 11640 

gcctttgcga cgctcaccgg gctggttgcc ctcgccgctg ggctggcggc cgtctatggc 11700 

cctgcaaacg cgccagaaac gccgtcgaag ccgtgtgcga gacaccgcgg ccgccggcgt 11760 

tgtggatacc tcgcggaaaa cttggccctc actgacagat gaggggcgga cgttgacact 11820 

tgaggggccg actcacccgg cgcggcgttg acagatgagg ggcaggctcg atttcggccg 11880 

gcgacgtgga gctggccagc ctcgcaaatc ggcgaaaacg cctgatttta cgcgagtttc 11940 

ccacagatga tgtggacaag cctggggata agtgccctgc ggtattgaca cttgaggggc 12000 

gcgactactg acagatgagg ggcgcgatcc ttgacacttg aggggcagag tgctgacaga 12060 

tgaggggcgc acctattgac atttgagggg ctgtccacag gcagaaaatc cagcatttgc 12120 

aagggtttcc gcccgttttt cggccaccgc taacctgtct tttaacctgc ttttaaacca 12180 

atatttataa accttgtttt taaccagggc tgcgccctgt gcgcgtgacc gcgcacgccg 12240 
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aagggcggtg cccccccttc rcgaaccctc ccggcccgct aacgcgggcc tcccatcccc 12300 

ccagcggctg cccccctcgg ccgcgaacgg cctcacccca aaaatggcag cgctggcagt 12360 

ccttgccatt gccgggatcg gggcagtaac gggatgggcg atcagcccga gcgcgacgcc 12420 

cggaagcatt gacgtgccgc aggtgctggc atcgacattc agcgaccagg tgccgggcag 12480 

tgagggcggc ggcctgggtg gcggcctgcc cttcacttcg gccgtcgggg cattcacgga 12540 

cttcatggcg gggccggcaa tttttacctt gggcattctt ggcatagtgg tcgcgggtgc 12600 

cgtgctcgtg ttcgggggtg cgataaaccc agcgaaccat ttgaggtgat aggtaagatt 12660 

ataccgaggt atgaaaacga gaattggacc tttacagaat tactctatga agcgccatat 12720 

ttaaaaagct accaagacga agaggatgaa gaggatgagg aggcagattg ccttgaatat 12780 

attgacaata ctgataagat aatatatctt ttatatagaa gatatcgccg tatgtaagga 12840 

tttcaggggg caaggcatag gcagcgcgct tatcaatata tctatagaat gggcaaagca 12900 

taaaaacttg catggactaa tgcttgaaac ccaggacaat aaccttatag cttgtaaatt 12960 

ctatcataat tgggtaatga ctccaactta ttgatagtgt tttatgttca gataatgccc 13020 

gatgactttg tcatgcagct ccaccgattt tgagaacgac agcgacttcc gtcccagccg 13080 

tgccaggtgc tgcctcagat tcaggttatg ccgctcaatt cgctgcgtat atcgcttgct 13140 

gattacgtgc agct t tccc.t tcaggcggga ttcatacagc ggccagccat ccgtcatcca 13200 

tatcaccacg tcaaagggtg acagcaggct cataagacgc cccagcgtcg ccatagtgcg 13260 

ttcaccgaat acgtgcgcaa caaccgtctt ccggagactg tcatacgcgt aaaacagcca 13320 

gcgctggcgc gatttagccc cgacatagcc ccactgttcg tccatttccg cgcagacgat 13380 

gacgtcactg cccggctgta tgcgcgaggt taccgactgc ggcctgagtt ttttaagtga 13440 

cgtaaaatcg tgttgaggcc aacgcccata atgcgggctg ttgcccggca tccaacgcca 13500 
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ttcatcgcca tatcaaicat tttctggtgc gtaccgggtt gagaaccgct gtaactgaac 13560 

tgcagttgcc argttttacg gcagtgagag cagagatagc cctgatgtcc ggcggtgctt 13620 

ttgccgt rac gcaccacccc gtcagtagct gaacaggagg gacagctgat agacacagaa 13680 

gccactggag caccicaaaa acaccatcat acactaaatc agtaagttgg cagcatcacc 13740 

cataattgtg gtttcaaaat cggctccgtc gatactatgt tatacgccaa ctttgaaaac 13800 

aactttgaaa aagctgtttt ctggtattta aggttttaga atgcaaggaa cagtgaattg 13860 

gagttcgtct tgttataatt agcttcttgg ggtatcttta aatactgtag aaaagaggaa 13920 

ggaaataata aatggctaaa atgagaatat caccggaatt gaaaaaactg atcgaaaaat 13980 

accgctgcgt aaaagatacg gaaggaatgt ctcctgctaa ggtatataag ctggtgggag 14040 

aaaatgaaaa cctatattta aaaatgacgg acagccggta taaagggacc acctatgatg 14100 

tggaacggga aaaggacatg atgctatggc tggaaggaaa gctgcctgtt ccaaaggtcc 14160 

tgcactttga acggcatgat ggctggagca atctgctcat gagtgaggcc gatggcgtcc 14220 

tttgctcgga agagtatgaa gatgaacaaa gccctgaaaa gattatcgag ctgtatgcgg 14280 

agtgcatcag gctctttcac tccatcgaca tatcggattg tccctatacg aatagcttag 14340 

acagccgctt agccgaattg gattacttac tgaataacga tctggccgat gtggattgcg 14400 

aaaactggga agaagacact ccatttaaag atccgcgcga gctgtatgat tttttaaaga 14460 

cggaaaagcc cgaagaggaa cttgtctttt cccacggcga cctgggagac agcaacatct 14520 

ttgtgaaaga tggcaaagta agtggcttta ttgatcttgg gagaagcggc agggcggaca 14580 

agtggtatga cattgccttc tgcgtccggt cgatcaggga ggatatcggg gaagaacagt 14640 

atgtcgagct attttttgac ttactgggga tcaagcctga ttgggagaaa ataaaatatt 14700 

atattttact ggatgaattg ttttagtacc tagatgtggc gcaacgatgc cggcgacaag 14760 
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caggagcgca 
a a g z a z z z c g 
gagaaggacg 
gacaccaagg 
ggggcaatcc 
gaactgatcg 
atgcgtgcgc 
aagatcgagc 
gtggagcgtt 
gacacgcgag 
caggtcagcg 
atgcagcttt 
gacacgcccc 
caaaacaagg 
ctgcgggccg 
cctatcggcg 
atcaatggcc 
atgggcttca 
cgcgtcctgg 
gtcgtgctgt 
tcgccgacgg 



ccgacttctt 
gcaaggggtc 
gccagacggn 
caccaggcgg 
cgcaaggagg 
acgcggggtt 
cccgcgaaac 
gcgacagcgt 
cgcgtcgtct 
gaactatgac 
aggccaagca 
ccttgttcga 
gctctgccct 
tcattttcca 
acgatgacga 
agccgatcac 
ggtattacac 
cgtccgaccg 
accgtggcaa 
ttgctggcga 
cccgacggat 



ccgcatcaag 
gctggtattc 
ctacgggacc 
gtcaaatcag 
gtgaatgaat 
ttccgccgag 
cttccagtcc 
gcaactggct 
cgaacaggag 
gaccaagaag 
ggccgcgttg 
tattgcgccg 
gttcaccacg 
cgtcaacaag 
actggtgtgg 
cttcacgttc 
gaaggccgag 
cgttgggcac 
gaaaacgtcc 
ccactacacg 
gttcgactat 



tgttttggct 

gtgcagggca 

gacttcattg 

gaataagggc 

cggacgtttg 

gatgccgaaa 

gtcggctcga 

ccccctgccc 

gcggcaggtt 

cgaaaaaccg 

ctgaaacaca 

tggccggaca 

cgcaacaaga 

gacgtgaaga 

cagcaggtgt 

tacgagcttt 

gaatgcctgt 

ctggaatcgg 

cgttgccagg 

aaattcatat 

ttcagctcgc 



ctcagcccca 

agattcggaa 

ccgataaggt 

acattgcccc 

accggaaggc 

ccatcgcaag 

tggtccagca 

tgcccgcgcc 

tggcgaagtc 

ccggcgagga 

cgaagcagca 

cgatgcgagc 

aaatcccgcg 

tcacctacac 

tggagtacgc 

gccaggacct 

cgcgcctaca 

tgtcgctgct 

tcctgatcga 

gggagaagta 

accgggagcc 



ggcccacggc 

taccaagtac 

ggattanctg 

ggcgtgagrc 

atacaggcaa 

ccgcaccgtc 

agctacggcc 

atcggccgcc 

gatgaccatc 

cctggcaaaa 

gatcaaggaa 

gatgccaaac 

cgaggcgctg 

cggcgtcgag 

gaagcgcacc 

gggctggtcg 

ggcgacggcg 

gcaccgcttc 

cgaggaaatc 

ccgcaagctg 

gtacccgctc 



14820 

14830 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 
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aagctcgaaa ccttccgcct catgtgcgga tcggattcca cccgcgtgaa gaagtgccgc I60S0 

gagcaccicg gcgaagcctg cgaagagttc cgaggcagcc gcctcgtgga acacgcctgg 1614 0 

gtcaatgatg acctggtgca ttgcaaacgc tagggccttg tggggtcagt tccggctggg 16200 

ggttcagcag ccagcgcttt actggcattt caggaacaag cgggcactigc icgacgcact 16260 

tgcttcgctc agtatcgctc gggacgcacg gcgcgctcta cgaactgccg ataaacagag 16320 

gattaaaatt gacaattgtg attaaggctc agattcgacg gcttggagcg gccgacgtgc 16380 

aggatttccg cgagatccga ttgtcggccc tgaagaaagc tccagagatg ttcgggtccg 16440 

tttacgagca cgaggagaaa aagcccatgg aggcgttcgc tgaacggttg cgagatgccg 16500 

tggcattcgg cgcctacatc gacggcgaga tcattgggct gtcggtcttc aaacaggagg 16560 

acggccccaa ggacgctcac aaggcgcatc tgtccggcgt tttcgtggag cccgaacagc 16620 

gaggccgagg ggtcgccggt atgctgctgc gggcgttgcc ggcgggttta ttgctcgtga 16680 

tgatcgtccg acagattcca acgggaatct ggtggatgcg catcttcatc ctcggcgcac 16740 

ttaatatttc gctattctgg agcttgttgt ttatttcggt ctaccgcctg ccgggcgggg 16800 

tcgcggcgac ggtaggcgct gtgcagccgc tgatggtcgt gttcatctct gccgctctgc 16860 

taggtagccc gatacgattg atggcggtcc tgggggctat ttgcggaact gcgggcgtgg 16920 

cgctgttggt gttgacacca aacgcagcgc tagatcctgt cggcgtcgca gcgggcctgg 16980 

cgggggcggt ttccatggcg ttcggaaccg tgctgacccg caagtggcaa cctcccgtgc 17040 

ctctgctcac ctttaccgcc tggcaactgg cggccggagg acttctgctc gttccagtag 17100 

ctttagtgtt tgatccgcca atcccgatgc ctacaggaac caatgttctc ggcctggcgt 17160 

ggctcggcct gatcggagcg ggtttaacct acttcctttg gttccggggg atctcgcgac 17220 

tcgaacctac agttgtttcc ttactgggct ttctcagccc cagatctggg gtcgatcagc 17280 
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cggggatgca tcaggccgac agtcggaact tcgggtcccc gacctgtacc attcgctgag 17340 

caarggatac gggagttgat atcgtcaacg ttcacttcta aagaaatagc gccactcagc 17400 

ttcctcagcg gctttstcca gcgatttcct attatgtcgg catagttctc aagatcgaca 17460 

gccrgicacc gtraagcgag aaatgaataa gaaggctgat aattcggatc tctgcgaggg 17520 

agatgatatt tgatcacagg cagcaacgct ctgtcatcgt tacaatcaac atgctaccct 17580 

ccgcgagatc atccgtgttt caaacccggc agcttagttg ccgttcttcc gaatagcatc 17640 

ggtaacatga gcaaagtctg ccgccttaca acggctctcc cgctgacgcc gtcccggact 17700 

gatgggctgc ctgtatcgag tggtgatttt gtgccgagct gccggtcggg gagctgttgg 17760 

ctggctggtg gcaggatata ttgtggtgta aacaaattga cgcttagaca acttaataac 17820 

acattgcgga cgtttttaat gtactggggt ggtttttctt ttcaccagtg agacgggcaa 17880 

cagctgattg cccttcaccg cctggccctg agagagttgc agcaagcggt ccacgctggt 17940 

ttgccccagc aggcgaaaat cctgtttgat ggtggttccg aaatcggcaa aatcccttat 18000 

aaatcaaaag aatagcccga gatagggttg agtgttgttc cagtttggaa caagagtcca 18060 

ctattaaaga acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc 18120 

ccactacgtg aaccatcacc caaatcaagt tttttggggt cgaggtgccg taaagcacta 18180 

aatcggaacc ctaaagggag cccccgattt agagcttgac ggggaaagcc ggcgaacgtg 18240 

gcgagaaagg aagggaagaa agcgaaagga gcgggcgcca ttcaggctgc gcaactgttg 18300 

ggaagggcga tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc 18360 

tgcaaggcga ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac 18420 

ggccagtgaa ttcgagctcg gtacccggg 18449 

<210> 41 



BASF NAE 878/03 



154/357 



<211> 18449 

<2I2> DNA 

<213> Artificial 

<220> 

<223> Plasmid 



<220> 

<221> misc_f eature 

<222> (3471) . . (3471) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (3679) . . (3679) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (3770) . . (3770) 

<223> n is' a, c, g, or t 

<400> 41 

gatctttcga cactgaaata cgtcgagcct gctccgcttg gaagcggcga ggagcctcgt 60 

cctgtcacaa ctaccaacat ggagtacgat aagggccagt tccgccagct cattaagagc 120 

cagttcatgg gcgttggcat gatggccgtc atgcatctgt acttcaagta caccaacgct 180 

cttctgatcc agtcgatcat ccgctgaagg cgctttcgaa tctggttaag atccacgtct 240 

tcgggaagcc agcgactggt gacctccagc gtccctttaa ggctgccaac agctttctca 300 

gccagggcca gcccaagacc gacaaggcct ccctccagaa cgccgagaag aactggaggg 360 

gtggtgtcaa ggaggagtaa gctccttatt gaagtcggag gacggagcgg tgtcaagagg 420 

atattcttcg actctgtatt atagataaga tgatgaggaa ttggaggtag catagcttca 480 

tttggatttg ctttccaggc tgagactcta gcttggagca tagagggtcc tttggctttc 540 
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aatattcica agtatctcga gtttgaactt attccctgtg aaccttttat tcaccaatga 600 

gcattggaat gaacatgaat ctgaggactg caatcgccat gacgttrtcc aaatacatcc 660 

ggatgtcgaa ggcttggggc acctgcgttg gttgaattta gaacgtggca ctattgatca 720 

tccgatagcx: ctgcaaaggg cgttgcacaa tgcaagtcaa acgttgctag cactcccagg 780 

tggaatgtta tgatgagcat tgtattaaat caggagatat agcatgatct ctagttagct 840 

caccacaaaa gtcagacggc gtaaccaaaa gtcacacaac acaagctgta aggatttcgg 900 

cacggctacg gaagacggag aagccacctt cagtggactc gagtaccatt taattctatt 960 
tgtgtttgat cgagacctaa tacagcccct acaacgacca tcaaagtcgt atagctacca ' 1020 

gtgaggaagt ggactcaaat cgacttcagc aacatctcct ggataaactt taagcctaaa 1080 

ctatacagaa taagataggt ggagagctta taccgagctc ccaaatctgt ccagatcatg 1140 

gttgaccggt: gcctggatct tcctatagaa tcatccttat tcgttgacct agctgattct 1200 

ggagtgaccc agagggtcat gacttgagcc taaaatccgc cgcctccacc att.tgtagaa 1260 

aaatgtgacg aactcgtgag ctctgtacag tgaccggtga ctctttctgg catgcggaga 1320 

gacggacgga cgcagagaga agggctgagt aataagccac tggccagaca gctctggcgg 1380 

ctctgaggtg cagtggatga ttattaatcc gggaccggcc gcccctccgc cccgaagtgg 1440 

aaaggctggt gtgcccctcg ttgaccaaga atctattgca tcatcggaga atatggagct 1500 

tcatcgaatc accggcagta agcgaaggag aatgtgaagc caggggtgta tagccgtcgg 1560 

cgaaatagca tgccattaac ctaggtacag aagtccaatt gcttccgatc tggtaaaaga 1620 

ttcacgagat agtaccttct ccgaagtagg tagagcgagt acccggcgcg taagctccct 1680 

aattggccca tccggcatct gtagggcgtc caaatatcgt gcctctcctg ctttgcccgg 1740 

tgtatgaaac cggaaaggcc gctcaggagc tggccagcgg cgcagaccgg gaacacaagc 1800 
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tggcagtcga cccatccggt gctctgcact ccacctgctg aggtccctca gtccctggta 1860 

ggcagctt-tg ccccgrctgt ccccccggtg tgtcggcggg cttgacaagg tcgttgcgtc 1S20 

agtccaacat ttgttgccat attttcctgc tctccccacc agctgctctt ttcittcctc 1980 

tttcttttcc catcttcagc atattcatct tcccatccaa gaacctttat ttcccctaag 2040 

taagtacttt gctacatcca tactccatcc ttcccatccc ttattccttt gaacctttca 2100 

gttcgagctt tcccacttca tcgcagcttg actaacagct accccgcttg agcagacatc 2160 

accatgcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa gttcgacagc 2220 

gtctccgacc tgatgcagct ctcggaggqc gaagaatctc gtgctttcag cttcgatgta 2280 

ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg atggtttcta caaagatcgt 2340 

tatgtttatc ggcactttgc atcggccgcg ctcccgattc cggaagtgct tgacattggg 2400 

gaattcagcg agagcctgac ctattgcatc tcccgccgtg cacagggtgt cacgttgcaa 2460 • 

gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg 2520 

atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc 2580 

ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac 2640 

tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagc.tg 2700 

atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga ■ t ttcggct cc 2760 

aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag cgaggcgatg 2820 

ttcggggatt cccaatacga ggtcgccaac atcttcttct ggaggccgtg gttggcttgt 2880 

atggagcagc agacgcgcta cttcgagcgg aggcatccgg agcttgcagg atcgccgcgg 2940 

ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt ggttgacggc 3000 

aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg atccggagcc 3060 
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gggacrgtcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac cgatcgctgt 3120 

gtagaagtac tcgccgacag tggaaaccga cgccccagca ctcgtcccag ggcaaaggaa 3180 

tagaguagat gccgaccgcg ggatcgatcc acttaacgtt actgaaatca tcaaacagct 3240 

tgacgaarci ggaiataaga tcgttggtgt cgatgtcagc tccggagttg agacaaatgg 3300 

tgttcaggat ctcgataaga tacgttcatt tgtccaagca gcaaagagtg ccttctagrg 3360 

atttaatagc tccangtcaa caagaataaa acgcgttttc gggtttacct cttccagata 3420 

cagctcatct gcaatgcatt aatgcattga ctgcaaccta gtaacgcctt ncaggctccg 3480 

gcgaagagaa gaatagctta gcagagctat tttcattttc gggagacgag atcaagcaga 3540 

tcaacggtcg tcaagagacc tacgagactg aggaatccgc tcttggctcc acgcgactat 3600 

atatttgtct ctaattgtac tttgacatgc tcctcttctt tactctgata gcttgactat 3660 

gaaaattccg tcaccagcnc ctgggttcgc aaagataatt gcatgtttct tccttgaact 3720 

ctcaagccta caggacacac attcatcgta ggtataaacc tcgaaatcan ttcctactaa 3780 

gatggtatac aatagtaacc atgcatggtt gcctagtgaa tgctccgtaa cacccaatac 3840 

gccggccgaa acttttttac aactctccta tgagtcgttt acccagaatg cacaggtaca 3900 

cttgtttaga ggtaatcctt ctttctagct agaagtcctc gtgtactgtg taagcgccca 3960 

ctccacatct ccactcgacc tgcaggcatg caaagcttga gattaaaata gataaggaaa 4020 

agaaagtgaa aagaaattcg gaagcatggc acattcttct ttttataaat acatgcctga 4080 

ctttcttttt ccatcgatat gatatatgca tatgatagat atacaagcaa tcttcttcaa 4140 

ggagtttgaa attttgtcct ccaggagcaa aaaaaagttt ttttttatac atgtttgtac 4200 

acaagaatag ttaccaattt gctttggtct tacgtgctgc aagtttatat cgttttcaat 4260 

ttctttgtct ttacattttc tttgtccttt atctttcctc atttagtctt tgggagaatt 4320 
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aggaaaaggg agcggaaagg taagaaatgc ttgcgtattt tactaattcc gcaaacetcc 43S0 

aatttggcaa acagcagcct gtgcaacgct ctcgagatga cagtatcttt garracactc 4440 

taaatctcga tgacccgacc aaaaagagcg aacaaagaaa taatcttgtg cart ccaaia 4500 

tgatggaaga ttttttcccc cttattctaa atgttgacat agcgtgtatg ttarataaac 4560 

aaaaagaaat tgtacaaact ttcttttctt ctctttttat tttatctcta tgctgtcgaa 4620 

gctgcagtca atcagcgtca aggcccgccg cgttgaacta gcccgcgaca tcacgcggcc 4 6SO 

caaagtctgc ctgcatgctc agcggtgctc gttagttcgg ctgcgagtgg cagcaccaca 4740 

gacagaggag gcgctgggaa ccgtgcaggc tgccggcgcg ggcgatgagc acagcgccga 4 800 

tgtagcactc cagcagcttg accgggctat cgcagagcgt cgtgcccggc gcaaacggga 4860 

gcagctgtca taccaggctg ccgccattgc agcatcaatt ggcgtgtcag gcattgccat 4 920 

cttcgccacc tacctgagat ttgccatgca catgaccgtg ggcggcgcag tgccatgggg 4 980 

tgaagtggct ggcactctcc tcttggtggt tggtggcgcg ctcggcatgg agatgtatgc 5040 

ccgctatgca cacaaagcca tctggcatga gtcgcctctg ggctggctgc tgcacaagag 5100 

ccaccacaca cctcgcactg gaccctttga agccaacgac ttgtttgcaa tcatcaatgg 5160 

actgcccgcc atgctcctgt gtacctttgg cttctggctg cccaacgtcc tgggggcggc 5220 

ctgctttgga gcggggctgg gcatcacgct atacggcatg gcatatatgt ttgtacacga 5280 

tggcctggtg cacaggcgct ttcccaccgg gcccatcgct ggcctgccct acatgaagcg 5340 

cctgacagtg gcccaccagc tacaccacag cggcaagtac ggtggcgcgc cctggggtat 5400 

gttcttgggt ccacaggagc tgcagcacat tccaggtgcg gcggaggagg tggagcgact 54 60 

ggtcctggaa ctggactggt ccaagcgggc gattgtgact gatagcgaga ctctgggtcg 5520 

atgttatctg cctcaacaat ggcttagaaa agaagaaaca gaacaaatac agcaaggcaa 5580 
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cgcccgzaqc ctacgtgatc aaagactgtt gggcttgtct ctgaagctrg tagcaaacgc 5640 

agacgctatc atcgrgagag ctaagaaggg cattgacaag ttgccggcaa acictcaagg 5700 

cggtgracga gctgcttgcc aagtatatgc tgcaattgga tctgtactca agcagcagaa 5760 

gacaacatai cctacaagag Ctcatctaaa aggaagcgaa cgtgccaaga ttgcictgtt 5S20 

gagtgtatac aacctctatc aatctgaaga caagcctgtg gctctccgtc aagctagaaa 5880 

gattaagagt ttttttgttg attagtgaat ttttgtttta tttatgtctg atagttcaat 5940 

aaagagacaa cacatacaat ataaaatcat tgtctttaaa tgttaattta gtagagtgta 6000 

aagcctgcat tttttttgta cgcataaaca atgaattcac cccgcttctg gtttttaaat 6060 

aattatgtca aactagggaa aattcttttt tttctcttcg ttcttttttt ggcttgttgt 6120 

ggagtcacag gcttgtcttc agattgatag aggttgtata cactcaacag agcaatcttg 6180 

gcacgttcgc ttccttttag atgagctctt gtaggatatg ttgtcttctg ctgcttgagt • 6240 

acagatccaa ttgcagcata tacttggcaa gcagctcgta caccgccttg acagtttgcc 6300 

ggcaacttgt caatgccctt cttagctctc accatgatag cgtctgcctt tcctacaagc ' 6360 

ttcagagaca agcccaacag tctttgatca cctaggctac gggcgttgcc ttgctgtatt 6420 

tgttctgttt cttcttttct aagccattgt tgaggcagat aacatcgacc caacatcctc 6480 

gagccatact acagcataaa aggatacgtt ttctttaaca gaaatttacc cttttgttat 6540 

cagcacatac aaaaaaaaag aaatttaaga tgagtaggac ttccattctc tcaaaaattt 6600 

tattcaatcc ataaatgaat tatttttgga caaaaaagaa agattatgcc tgattttctc 6660 

tatttttttt ttttttacaa ctccaccaat actttctagc ccagcttggc gtaatcatgg 6720 

tcatagctgt ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc 6780 

ggaagcataa agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg 6840 
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tugcgctcac tgcccgcttt ccagtcggga aacctgtcgt gccagcrgca ttaatgaatc 6900 

ggccaacgcg cggggagagg cggtttgcgt attgggccaa agacaaaagg gcgacattca 6960 

accgattgag ggagggaagg taaatattga cggaaattat tcattaaagg tgaatrarca 7020 

ccgtcaccga cttgagccat ttgggaatta gagccagcaa aatcaccagt agcaccatta 7030 

ccattagcaa ggccggaaac gtcaccaatg aaaccatcga tagcagcacc gtaatcagta 7140 

gcgacagaat caagtttgcc tttagcgtca gactgtagcg cgttttcatc ggcattttcg 7200 

gtcatagccc ccttattagc gtttgccatc ttttcataat caaaatcacc ggaaccagag 7260 

ccaccaccgg aaccgcctcc ctcagagccg ccaccctcag aaccgccacc ctcagagcca 7320 

ccaccctcag agccgccacc agaaccacca ccagagccgc cgccagcatt gacaggaggc 7380 

ccgatctagt aacatagatg acaccgcgcg cgataattta tcctagtttg cgcgctatat 7440 

tttgttttct atcgcgtatt aaatgtataa ttgcgggact ctaatcataa aaacccatct 7500 

cataaataac gtcatgcatt acatgttaat tattacatgc ttaacgtaat tcaacagaaa 7560 

ttatatgata atcatcgcaa gaccggcaac aggattcaat cttaagaaac tttattgcca 7620 

aatgtttgaa cgatcgggga tcatccgggt ctgtggcggg aactccacga aaatatccga 7680 

acgcagcaag atatcgcggt gcatctcggt cttgcctggg cagtcgccgc cgacgccgtt 7740 

gatgtggacg ccgggcccga tcatattgtc gctcaggatc gtggcgttgt gcttgtcggc 7800 

cgttgctgtc gtaatgatat cggcaccttc gaccgcctgt tccgcagaga tcccgtgggc 7860 

gaagaactcc agcatgagat ccccgcgctg gaggatcatc cagccggcgt cccggaaaac 7920 

gattccgaag cccaaccttt catagaaggc ggcggtggaa tcgaaatctc gtgatggcag 7980 

gttgggcgtc gcttggtcgg tcatttcgaa ccccagagtc ccgctcagaa gaactcgtca 8040 

agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg cgataccgta aagcacgagg 8100 
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aagcggtcag cccattcgcc gccaagctct tcagcaatat cacccctagc caacgctatg SI60 

tcctgatagc cgtccgccac acccagccgg ccacagtcga tgaatccaca aaagcqgcca 3220 

ttttccacca tgarattcgg caagcaggca tcgccatggg tcacgacgag atcat cgccg 8280 

tcgggcatgc gcgccttgag cctggcgaac agttcggctg gcgcgagccc ctgat cctct 8340 

tcgtccagat catcctgatc gacaagaccg gcttccatcc gagtacgtgc tcgct cqatg 8400 

cgatgtttcg cttggtggtc gaatgggcag gtagccggat caagcgtatg cagccgccgc 8460 

attgcatcag ccatgatgga tactttctcg gcaggagcaa ggtgagatga caggagatcc S520 

tgccccggca cttcgcccaa tagcagccag tcccttcccg cttcagtgac aacgtcgagc- 8580 

acagctgcgc aaggaacgcc cgtcgtggcc agccacgata gccgcgctgc ctcgt cctgc 8640 

agttcattca gggcaccgga caggtcggtc ttgacaaaaa gaaccgggcg cccct'gcgct 8700 

gacagccgga acacggcggc atcagagcag ccgattgtct gttgtgccca gtcatagccg 8760 

aatagcctct ccacccaagc ggccggagaa cctgcgtgca atccatcttg ttcaatcatg 8820 

cgaaacgatc cagatccggt gcagattatt tggattgaga gtgaatatga gactctaatt 8880 

ggataccgag gggaatttat ggaacgtcag tggagcattt ttgacaagaa atatttgcta 8940 

gctgatagtg accttaggcg acttttgaac gcgcaataat ggtttctgac gtatgtgctt 9000 

agctcattaa actccagaaa cccgcggctg agtggctcct tcaacgttgc ggttctgtca 9060 

gttccaaacg taaaacggct tgtcccgcgt catcggcggg ggtcataacg tgactccctt 9120 

aattctccgc tcatgatcag attgtcgttt cccgccttca gtttaaacta tcagtgtttg 9180 

acaggatata ttggcgggta aacctaagag aaaagagcgt ttattagaat aatcggatat 9240 

ttaaaagggc gtgaaaaggt ttatccgttc gtccatttgt atgtgcatgc caaccacagg 9300 

gttccccaga tctggcgccg gccagcgaga cgagcaagat tggccgccgc ccgaaacgat 9360 
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ccgacagccc gcccagcaca ggrgcgcagg caaattqcsc caacccatac agcgccagca 9420 

gaatgccata gtcggcggtg acgtcgttcg agtgaaccag atcgcgcagg aggcccggca 9480 

gcaccggcat aatcaggccg atgccgacag cgtcgagcgc gacagtgctc agaattacga 9540 

tcaggggtat grtgggtttc acgtctggcc tccggaccag cctccgctgg tccgattgaa 9600 

cgcgcggatt ctttatcact gataagttgg tggacatatt atgtttatca gtgataaagt 9660 

gtcaagcatg acaaagttgc agccgaatac agtgatccgt gccgccctgg acctgttgaa 9720 

cgaggtcggc gtagacggtc tgacgacacg caaactggcg gaacggttgg gggttcagca 9780 

gccggcgctt tactggcact tcaggaacaa gcgggcgctg ctcgacgcac tggccgaagc 9840 

catgctggcg gagaatcata cgcattcggt gccgagagcc gacgacgact ggcgctcatt 9900 

tctgatcggg aatgcccgca gcttcaggca ggcgctgctc gcctaccgcg atggcgcgcg 9960 

catccatgcc ggcacgcgac cgggcgcacc gcagatggaa acggccgacg cgcagcttcg 10020 - 

cttcctctgc gaggcgggtt tttcggccgg ggacgccgtc aatgcgctga tgacaatcag 10080 

ctacttcact gttggggccg tgcttgagga gcaggccggc gacagcgatg ccggcgagcg 10140 

cggcggcacc gttgaacagg ctccgctctc gccgctgttg cgggccgcga tagacgcctt 10200 

cgacgaagcc • ggtccggacg cagcgttcga gcagggactc gcggtgattg tcgatggatt 10260 

ggcgaaaagg aggctcgttg tcaggaacgt tgaaggaccg agaaagggtg acgattgatc 10320 

aggaccgctg ccggagcgca acccactcac tacagcagag ccatgtagac aacatcccct 10380 

ccccctttcc accgcgtcag acgcccgtag cagcccgcta cgggcttttt catgccctgc 10440 

cctagcgtcc aagcctcacg gccgcgctcg gcctctctgg cggccttctg gcgctcttcc 10500 

gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 10560 

cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 10620 
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tcagcaaaag gccaccaaaa ggccagcaac cgtaaaaagc ccgcgttgct ggcctt-ittc 106S0 

cataggctcc gcccccctga cgagcatcac aaaaatcgac cctcaagica gaggtggcga 10740 

aacccgacag gactataaag ataccaggcg tttccccctg gaagctcccr cgtgcgcict 10800 

ccrgtrccga ccctgccgct taccggatac ctgtccgcct .ttctcccttc gggaagcgtq 10860 

gcgcttttcc gctgcataac cctgcttcgg ggtcattata gcgatttttt cggtatatcc 10920 

atcctttttc gcacgatata caggattttg ccaaagggtt cgtgtagact ttccttggtg 10980 

tatccaacgg cgtcagccgg gcaggatagg tgaagtaggc ccacccgcga gcgggtgttc 11040 

cttcttcact gtcccttatt cgcacctggc ggtgctcaac gggaatcctg ctctgcgagg 11100 

ctggccggct accgccggcg taacagatga gggcaagcgg atggctgatg aaaccaagcc 11160 

aaccaggaag ggcagcccac ctatcaaggt gtactgcctt ccagacgaac gaagagcgat 11220 

tgaggaaaag gcggcggcgg ccggcatgag cctgtcggcc tacctgctgg ccgtcggcca 11280 

gggctacaaa atcacgggcg tcgtggacta tgagcacgtc cgcgagctgg cccgcatcaa 11340 

tggcgacctg ggccgcctgg gcggcctgct gaaactctgg ctcaccgacg acccgcgcac 11400 

ggcgcggttc ggtgatgcca cgatcctcgc cctgctggcg aagatcgaag agaagcagga 114 60 

cgagcttggc aaggtcatga tgggcgtggt ccgcccgagg gcagagccat gactttttta 11520 

gccgctaaaa cggccggggg gtgcgcgtga ttgccaagca cgtccccatg cgctccatca 11580 

agaagagcga cttcgcggag ctggtgaagt acatcaccga cgagcaaggc aagaccgagc 11640 

gcctttgcga cgctcaccgg gctggttgcc ctcgccgctg ggctggcggc cgtctatggc 11700 

cctgcaaacg cgccagaaac gccgtcgaag ccgtgtgcga gacaccgcgg ccgccggcgt 11760 

tgtggatacc tcgcggaaaa cttggccctc actgacagat gaggggcgga cgttgacact 11820 

tgaggggccg actcacccgg cgcggcgttg acagatgagg ggcaggctcg atttcggccg 11880 
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gcgacgtgga cctggccagc ctcgcaaatc ggcgaaaacg cctcattt-a cgcgagrttc 11940 

ccacagatga tgtggacaag cctggggata agtgccctgc ggtattgaca cttgaggggc 12000 

gcgactactg acagatgagg ggcgcgatcc ttgacacttg aggggcagag tgctgacaga 12060 

ugaggggcgc acctattgac atttgagggg ctgtccacag gcagaaaatc cagcatttgc 12120 

aagggtttcc gcccgttttt cggccaccgc taacctgtct tttaacctgc ttttaaacca 12180 

atatttataa accttgtttt taaccagggc tgcgccctgt gcgcgtgacc gcgcacgccg 12240 

aaggggggtg cccccccttc tcgaaccctc ccggcccgct aacgcgggcc tcccatcccc 12300 

ccaggggctg cgcccctcgg ccgcgaacgg cctcacccca aaaatggcag cgctggcagt 12360 

ccttgccatt gccgggatcg gggcagtaac gggatgggcg atcagcccga gcgcgacgcc 12420 

cggaagcatt gacgtgccgc aggtgctggc atcgacattc agcgaccagg tgccgggcag 12480 

tgagggcggc ggcctgggtg gcggcctgcc cttcacttcg gccgtcgggg cattcacgga 12540 

cttcatggcg gggccggcaa tttttacctt gggcattctt ggcatagtgg tcgcgggtgc 12600 

cgtgctcgtg ttcgggggtg cgataaaccc agcgaaccat ttgaggtgat aggtaagatt 12660 

ataccgaggt atgaaaacga gaattggacc tttacagaat tactctatga agcgccatat 12720 

ttaaaaagct accaagacga agaggatgaa gaggatgagg aggcagattg ccttgaatat 12780 

attgacaata ctgataagat aatatatctt ttatatagaa gatatcgccg tatgtaagga 12840 

tttcaggggg caaggcatag gcagcgcgct tatcaatata tctatagaat gggcaaagca 12900 

taaaaacttg catggactaa tgcttgaaac ccaggacaat aaccttatag cttgtaaatt 12960 

ctatcataat tgggtaatga ctccaactta ttgatagtgt tttatgttca gataatgccc 13020 

gatgactttg tcatgcagct ccaccgattt tgagaacgac agcgacttcc gtcccagccg 13080 

tgccaggtgc tgcctcagat tcaggttatg ccgctcaatt cgctgcgtat atcgcttgct 13140 
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gattacgtgc agctttcccr tcaggcggga ttcatacacc ggccagccat ccctcatcca 13200 

tatcaccacg rcaaagggtg acagcaggct cataagacgc cccagcgicg ccatagtgcg 13260 

ttcaccgaat acgtgcgcaa caaccgtctt ccggagactg tcatacgcgt aaaacagcca 13320 

gcgctggcgc gatttagccc cgacatagcc ccactgttcg . tccatttccg cgcagacgat 13380 

gacgrcactg cccggctgta tgcgcgaggt taccgactgc ggcctgagtt ttttaagtga 13440 

cgtaaaatcg tgttgaggcc aacgcccata atgcgggctg ttgcccggca tccaacgcca 13500 

ttcatggcca tatcaatgat tttctggtgc gtaccgggtt gagaagcggt gtaagtgaac 13560 

tgcagttgcc atgttttacg gcagtgagag cagagatagc gctgatgtcc ggcggtgctt 13620 

ttgccgttac gcaccacccc gtcagtagct gaacaggagg gacagctgat agacacagaa 13680 

gccactggag cacctcaaaa acaccatcat acactaaatc agtaagttgg cagcatcacc 13740 

cataattgtg gtttcaaaat cggctccgtc gatactatgt tatacgccaa ctttgaaaac • 13800 

aactttgaaa aagctgtttt ctggtattta aggttttaga atgcaaggaa cagtgaattg 13860 

gagttcgtct tgttataatt agcttcttgg ggtatcttta aatactgtag aaaagaggaa 13920 

ggaaataata aatggctaaa atgagaatat caccggaatt gaaaaaactg atcgaaaaat 13980 

accgctgcgt aaaagatacg gaaggaatgt ctcctgctaa ggtatataag ctggtgggag 14040 

aaaatgaaaa cctatattta aaaatgacgg acagccggta taaagggacc acctatgatg 14100 

tggaacggga aaaggacatg atgctatggc tggaaggaaa gctgcctgtt ccaaaggtcc 14160 

tgcactttga acggcatgat ggctggagca atctgctcat gagtgaggcc gatggcgtcc 14220 

tttgctcgga agagtatgaa gatgaacaaa gccctgaaaa gattatcgag ctgtatgcgg 14280 

agtgcatcag gctctttcac tccatcgaca tatcggattg tccctatacg aatagcttag 14340 

acagccgctt agccgaattg gattacttac tgaataacga tctggccgat gtggattgcg 14400 
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aaaactggga agaagacact ccatttaaag atccgcgcga gctgtatgat tttttzaaaga 14460 

cggaaaagcc cgaagaggaa cttgtctttt cccacggcga cctgggagac agcaacatct 14520 

ttgtgaaaga tggcaaagta agtggcttta ttgatcttgg gagaagcggc agggcggaca 14580 

agtggtatga cattgccttc tgcgtccggt cgatcaggga ggatatcggg gaagaacagt 14640 

atgtcgagcr attttttgac ttactgggga tcaagcctga ttgggagaaa ataaaatatt 14700 

atattttact ggatgaattg ttttagtacc tagatgtggc gcaacgatgc cggcgacaag 14760 

caggagcgca ccgacttctt ccgcatcaag tgttttggct ctcaggccga ggcccacggc 14820 

aagtatttgg gcaaggggtc gctggtattc gtgcagggca agattcggaa taccaagtac 14880 

gagaaggacg gccagacggt ctacgggacc gacttcattg ccgataaggt ggattatctg 14940 

gacaccaagg caccaggcgg gtcaaatcag gaataagggc acattgcccc ggcgtgagtc 15000 

ggggcaatcc cgcaaggagg gtgaatgaat cggacgtttg accggaaggc atacaggcaa 15060 

gaactgatcg acgcggggtt ttccgccgag gatgccgaaa ccatcgcaag ccgcaccgtc 15120 

atgcgtgcgc cccgcgaaac cttccagtcc gtcggctcga tggtccagca agctacggcc 15180 

aagatcgagc gcgacagcgt gcaactggct ccccctgccc tgcccgcgcc atcggccgcc 15240 

gtggagcgtt cgcgtcgtct cgaacaggag gcggcaggtt tggcgaagtc gatgaccatc 15300 

gacacgcgag gaactatgac gaccaagaag cgaaaaaccg ccggcgagga cctggcaaaa 15360 

caggtcagcg aggccaagca ggccgcgttg ctgaaacaca cgaagcagca gatcaaggaa 15420 

atgcagcttt ccttgttcga tattgcgccg tggccggaca cgatgcgagc gatgccaaac 15480 

gacacggccc gctctgccct gttcaccacg cgcaacaaga aaatcccgcg cgagcjcgctg 15540 

caaaacaagg tcattttcca cgtcaacaag gacgtgaaga tcacctacac cggcgtcgag 15600 

ctgcgggccg acgatgacga actggtgtgg cagcaggtgt tggagtacgc gaagcgcacc 15660 
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cctatcggcc agccgatcac cttcacgttc tacgagcttt gccaggacct gcgctggtcg 15720 

atcaatggcc ggtattacac gaaggccgag gaatgcctgt cgcgcctaca ggcgacggcc 15780 

atgggcttca cgtccgaccg cgttgggcac ctggaatcgg tgtcgctgct gcaccgcttc 15840 

cgcgtcctgg accgrggcaa gaaaacgtcc cgttgccagg tcctgatcga cgacgaaatc 15900 

gtcgtgctgt ttgctggcga ccactacacg aaattcatat gggagaagta ccgcaagctg 15960 

tcgccgacgg cccgacggat gttcgactat ttcagctcgc accgggagcc gtacccgcrc 16020 

aagctggaaa ccttccgcct catgtgcgga tcggattcca cccgcgtgaa gaagtggcgc 16080 

gagcaggtcg gcgaagcctg cgaagagttg cgaggcagcg gcctggtgga acacgcctgg 16140 

gtcaatgatg acctggtgca ttgcaaacgc tagggccttg tggggtca'gt tccggctggg 16200 

ggttcagcag ccagcgcttt actggcattt caggaacaag cgggcactgc tcgacgcact 16260 

tgcttcgctc agtatcgctc gggacgcacg gcgcgctcta cgaactgccg ataaacagag 16320 

gattaaaatt gacaattgtg attaaggctc agattcgacg gcttggagcg gccgacgtgc 16380 

aggatttccg cgagatccga ttgtcggccc tgaagaaagc tccagagatg ttcgggtccg 16440 

tttacgagca cgaggagaaa aagcccatgg aggcgttcgc tgaacggttg cgagatgccg 16500 

tggcattcgg cgcctacatc gacggcgaga tcattgggct gtcggtcttc aaacaggagg 16560 

acggc.cccaa ggacgctcac aaggcgcatc tgtccggcgt tttcgtggag cccgaacagc 16620 

gaggccgagg ggtcgccggt atgctgctgc gggcgttgcc ggcgggttta ttgctcgtga 16680 

tgatcgtccg acagattcca acgggaatct ggtggatgcg catcttcatc ctcggcgcac 16740 

ttaatatttc gctattctgg agcttgttgt ttatttcggt ctaccgcctg ccgggcgggg 16800 

tcgcggcgac ggtaggcgct gtgcagccgc tgatggtcgt gttcatctct gccgctctgc 16860 

taggtagccc gatacgattg atggcggtcc tgggggctat ttgcggaact gcgggcgtgg 16920 
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cgctgttcgt gttgacacca aacgcagcgc tagatcctgt cggcctcgca gcgggcctgg I69S0 

cgggggccgt: trccatggcg ttcggaaccg tgctgacccg caagtggcaa cctcccgtgc 17040 

ctctgctcac ctttaccgcc tggcaactgg cggccggagg acttctgctc gttccagtag 17100 

ctttagtgtt tgatccgcca atcccgatgc ctacaggaac caatgttctc ggcctggcgt 17160 

ggcrcggcct gatcggagcg ggtttaacct acttcctttg gttccggggg atctcgcgac 17220 

tcgaacctac agttgtttcc ttactgggct ttctcagccc cagatctggg gtcgatcagc 17280 

cggggacgca tcaggccgac agtcggaact tcgggtcccc gacctgtacc attcggtgag 17340 

caatggatag gggagttgat atcgtcaacg ttcacttcta aagaaatagc gccactcagc 17400 

ttcctcagcg gctttatcca gcgatttcct attatgtcgg catagttctc aagatcgaca 17460 

gcctgtcacg gttaagcgag aaatgaataa gaaggctgat aattcggatc tctgcgaggg 17520 

agatgatatt tgatcacagg cagcaacgct ctgtcatcgt tacaatcaac atgctaccct 17580 

ccgcgagatc atccgtgttt caaacccggc agcttagttg ccgttcttcc gaatagcatc 17640 

ggtaacatga gcaaagtctg ccgccttaca acggctctcc cgctgacgcc gtcccggact 17700 

gatgggctgc ctgtatcgag tggtgatttt gtgccgagct gccggtcggg gagctgttgg 17760 

ctggctggtg " gcaggatata ttgtggtgta aacaaattga cgcttagaca acttaataac 17820 

acattgcgga cgtttttaat gtactggggt ggtttttctt ttcaccagtg agacgggcaa 17880 

cagctgattg cccttcaccg cctggccctg agagagttgc agcaagcggt ccacgctggt 17940 

ttgccccagc aggcgaaaat cctgtttgat ggtggttccg aaatcggcaa aatcccttat 18000 

aaatcaaaag aatagcccga gatagggttg agtgttgttc cagtttggaa caagagtcca 18060 

ctattaaaga acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc 18120 

ccactacgtg aaccatcacc caaatcaagt tttttggggt cgaggtgccg taaagcacta 18180 
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169/357 



aatcggaacc ctaaagggag cccccgattt agagcttgac ggggaaagcc cgcgaacgic 18240 

gcgagaaagg aagggaagaa agcgaaagga gcgggcgcca ttcagcctgc gcaactgttg 18 2 00 

ggaagggcga tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc 18360 

tgcaaggcga ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac 18420 

ggccagtgaa ttcgagctcg gtacccggg 1844 9 

<210> 42 

<211> 17593 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid 
<220> 

<221> misc_f eature 

<222> (10264) . . (10264) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (10472) . . (10472) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10563) . . (10563) 

<223> n is a, c, g, or t 

<400> 42 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 60 
aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 120 



aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 180 
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ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg nggagctggc 240 

caccctcgca aatcgccgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 300 

caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 

atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg ' gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgacrccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 
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agattcaggt tatgcccctc aattcgcioc gtatatcgct tgctgattac giccagcttt 1500 

cccttcaggc gggartcaca cagcggccag ccatccgtca tccatatcac cacgicaaac 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattra 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgceatgttt 1920 

tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 

ccccgtcagt agcrgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat eacccataat tgtggtttca • 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgttat ' 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 2460 

catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 
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attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaa.gg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agrcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 
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acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4 OS 0 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 

gcctcatgtg cggarcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 44 40 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc- atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4 920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat . 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 
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artgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggrgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtutccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 5460 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggtt.ccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 
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cacccaaatc aagttrcttg gggtcgaggt gccgtaaagc actaaetcgg aaccctaaac 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaqa aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgaitaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct* cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 
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atttaattct atrtgtgtrr gatcgagacc taatacagcc cctacaacga ccatcaaagt 7300 

cgtatagcua ccagtgagga agtggactca aatcgacttc agcaacatct cctcgataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7930 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 8460 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctct'c 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca . agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 
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aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttzt 9060 

caccttcgat gtagcagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 94 80 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 
gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc • 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 
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cttncaggct ccggcgaaga gaagaatagc rtagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctctitggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tctrccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt ttttcgagtt 10800 

tttttttttt ttctttgtga aggatttatt gttattggta tccatttttt attggaagac 10860 

aagataagtt aatattgatt ttgcttaaag attaaaagga aatcagaaaa cgacaataaa 10920 

aaatgtaacg gacaaactat ggtgtcgatt ataagtctaa atccttaaaa aatgacaacg 10980 

agttgctttc ctctgaaaac aattcttttg tctttgcaag aaaggtttct tttttgtttg 11040 

cttgcattac ttaaacatca aatcaaatga aaggaataaa gcagatttga gggcgaataa 11100 

ggattttctg gtcaacaaga tgtgagtgac acctaaggaa ctaaatgcca ttcatttgtt 11160 

ttaaaacgac atcaaagatt gatgatcaac aggattgaga gagagaaaaa gaactcgtgt 11220 

catttatttc tgttgactga aattttatat ttagaaaaaa tgtcaaatct atagctttag 11280 

ctatattaca taacatttga aataataata ataaaaaaag acacattaga gacacttttc 11340 

aaactctaaa taactgtcta taaacacaaa gaaaacaaag acctctataa caacttatta 11400 

gatttttctc gtacttttgt ctaaagatga tgtattcttg ttatcccaca cttctttcat 11460 

ttgttcttga tgctactaaa tatacaaaat ttcttttttg caagagatat tattccaaaa 11520 
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attttcaaaa agaaattttt ttcacaatac cagttgatcg tctaacccaa agagcttctt 11580 

tgttattttc cacitccgct ttgcggtgat gcatattcaa agtaatatat cgaataaaca 11640 

acgtgtttaa gcatgaaaga aaggaaacaa aggccgcttt gaacaaatgc ataaratttc 11700 

agacaaaaat gatctaaagc aagcagtaaa tcaaacaaga aacattgctg attcgcgtta 11760 

gaaaacgata aaagtctaat aagccactaa gtatacttca atgaactttt tgtatgctta 11820 

tggtccaatc agaccaataa tttgtgacca ttcctgaggt ggctttggtg atgcggaaac 11880 

agaaaaaaat tttctcacca atcgatttaa aaaacaattt ctgctttgaa ccaaaacttt 11940 

ttttttctct ttaatcatta actttatcaa gtatgtacct accctcaaag tcctcactca 12000 

agcacaatta tgctaacatt gttccacctt ctctttagaa atgttgtgga tttggaatgc 12060 

cctgatcgtt ttcgttaccg tgattggcat ggaagtgatt gctgcactgg cacacaaata 12120 

catcatgcac ggctggggtt ggggatggca tctttcacat catgaaccgc gtaaaggtgc 12180 

gtttgaagtt aacgatcttt atgccgtggt ttttgctgca ttatcgatcc tgctgattta 12240 

tctgggcagt acaggaatgt ggccgctcca gtggattggc gcaggtatga cggcgtatgg 12300 

attactctat tttatggtgc acgacgggct ggtgcatcaa cgttggccat tccgctatat 12360 

tccacgcaag ggctacctca aacggttgta tatggcgcac cgtatgcatc acgccgtcag 12420 

gggcaaagaa ggttgtgttt cttttggctt cctctatgcg ccgcccctgt caaaacttca 12480 

ggcgacgctc cgggaaagac atggcgctag agcgggcgct gccagagatg cgcagggcgg 12540 

ggaggatgag cccgcatccg ggaagtaagg gcctgaccag aggcggccag cagcagcgtt 12600 

aatttttcgg gcgtggtcgt tgactgccgc tgatcccaaa gcttggcgta atcatggtca 12660 

tagctgtttc ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat acgagccgga 12720 

agcataaagt gtaaagcctg gggtgcctaa tgagtgagct aactcacatt aattgcgttg 12780 
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cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc 12340 

caacccgccc ggagaggcgg 1 1 tgcgt att gggccaaaga caaaagggcg acattcaacc 12900 

gattgaggga gggaaggtaa atattgacgg aaattattca ttaaaggtga attatcaccg 12960 

tcaccgactt gagccatttg ggaattagag ccagcaaaat caccagtagc accattacca 13020 

ttagcaaggc cggaaacgtc accaatgaaa ccatcgatag cagcaccgta atcagtagcg 13080 

acagaatcaa gtttgccttt agcgtcagac tgtagcgcgt tttcatcggc attttcggtc 13140 

atagccccct tattagcgtt tgccatcttt tcataatcaa aatcaccgga accagagcca 13200 

ccaccggaac cgcctccctc agagccgcca ccctcagaac cgccaccctc agagccacca 13260 

ccctcagagc cgccaccaga accaccacca gagccgccgc cagcattgac aggaggcccg 13320 

atctagtaac atagatgaca ccgcgcgcga taatttatcc tagtttgcgc gctatatttt 13380 

gttttctatc gcgtattaaa tgtataattg cgggactcta atcataaaaa cccatctcat 13440 

aaataacgtc atgcattaca tgttaattat tacatgctta acgtaattca acagaaatta 13500 

tatgataatc atcgcaagac cggcaacagg attcaatctt aagaaacttt attgccaaat 13560 

gtttgaacga tcggggatca tccgggtctg tggcgggaac tccacgaaaa tatccgaacg 13620 

cagcaagata tcgcggtgca tctcggtctt gcctgggcag tcgccgccga cgccgttgat 13680 

gtggacgccg ggcccgatca tattgtcgct caggatcgtg gcgttgtgct tgtcggccgt 13740 

tgctgtcgta atgatatcgg caccttcgac cgcctgttcc gcagagatcc cgtgggcgaa 13800 

gaactccagc atgagatccc cgcgctggag gatcatccag ccggcgtccc ggaaaacgat 13860 

tccgaagccc aacctttcat agaaggcggc ggtggaatcg aaatctcgtg atggcaggtt 13920 

gggcgtcgct tggtcggtca tttcgaaccc cagagtcccg ctcagaagaa ctcgtcaaga 13980 

aggcgataga aggcgatgcg ctgcgaatcg ggagcggcga taccgtaaag cacgaggaag 14040 
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ccgtcagccc 
tgat agcggt: 
tccaccatga 
ggcatgcgcg 
tccagatcat 
tgtttcgctt 
gcatcagcca 
cccggcactt 
gctgcgcaag 
tcattcaggg 
agccggaaca 
agcctctcca 
aacgatccag 
taccgagggg 
gatagtgacc 
tcattaaact 
ccaaacgtaa 
tctccgctca 
ggatatattg 
aaagggcgtg 
ccccagatct 



attcgccgcc 
ccgccacacc 
tattcggcaa 
ccttgagcct 
cctgatcgac 
ggtggtcgaa 
tgatggatac 
cgcccaatag 
gaacgcccgt 
caccggacag 
cggcggcatc 
cccaagcggc 
atccggtgca 
aatttatgga 
ttaggcgact 
ccagaaaccc 
aacggcttgt 
tgatcagatt 
gcgggtaaac 
aaaaggttta 
ggcgccggcc 



aagctcttca 
cagccggcca 
gcaggcatcg 
ggcgaacagt 
aagaccggct 
tgggcaggta 
tttctcggca 
cagccagtcc 
cgtggccagc 
gtcggtcttg 
agagcagccg 
cggagaacct 
gattat ttgg 
acgtcagtgg 
tttgaacgcg 
gcggctgagt 
cccgcgtcat 
gtcgtttccc 
ctaagagaaa 
tccgttcgtc 
agcgagacga 



gcaatatcac 
cagtcgatga 
ccatgggtca 
tcggctggcg 
tccatccgag 
gccggatcaa 
ggagcaaggt 
cttcccgctt 
cacgatagcc 
acaaaaagaa 
att-gtctgtt 
gcgtgcaatc 
attgagagtg 
agcatttttg 
caataatggt 
ggctccttca 
cggcgggggt 
gccttcagtt 
agagcgttta 
catttgtatg 
gcaagattgg 



gggtagccaa 

at ccagaaaa 

cgacgagatc 

cgagcccctg 

tacgtgctcg 

gcgtatgcag 

gagatgacag 

cagtgacaac 

gcgctgcctc 

ccgggcgccc 

gtgcccagtc 

catcttgttc 

aatatgagac 

acaagaaata 

ttctgacgta 

acgttgcggt 

cataacgtga 

taaactatca 

ttagaataat 

tgcatgccaa 

ccgccgcccg 



cgccatgtcc 

gcggccattt 

atcgccgrcg 

atgctcttcg 

ctcgatgcga 

ccgccgcatt 

gagatcctgc 

gtcgagcaca 

gtcctgcagt 

ctgcg'ctgac 

atagccgaat 

aatcatgcga 

tctaattgga 

tttgctagct 

tgtgcttagc 

tctgtcagtt 

ctcccttaat 

gtgtttgaca 

cggatattta 

ccacagggtt 

aaacgatccg 



14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 
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acagcgcgcc cagcacaggt gcgcaggcaa attgcaccaa cgcatacagc gccagcagaa 15360 

tgccatagtg ggcggtgacg tcgttcgagt gaaccagatc gcgcaggagg cccggcagca 15420 

ccggcataat caggccgatg ccgacagcgt cgagcgcgac agtgctcaga attacgatca 15480 

ggggtatgtt gggtttcacg tctggcctcc ggaccagcct ccgctggtcc gattgaacgc 15540 

gcggattctt tatcactgat aagttggtgg acatattatg tttatcagtg ataaagtgtc 15600 

aagcatgaca aagttgcagc cgaatacagt gatccgtgcc gccctggacc tgttgaacga 15660 

ggtcggcgta gacggtctga cgacacgcaa actggcggaa cggttggggg ttcagcagcc 15720 

ggcgctttac tggcacttca ggaacaagcg ggcgctgctc gacgcactgg ccgaagccat 15780 

gctggcggag aatcatacgc attcggtgcc gagagccgac gacgactggc gctcatttct 15840 

gatcgggaat gcccgcagct tcaggcaggc gctgctcgcc taccgcgatg gcgcgcgcat 15900 

ccatgccggc acgcgaccgg gcgcaccgca gatggaaacg gccgacgcgc agcttcgctt 15960 

cctctgcgag gcgggttttt cggccgggga cgccgtcaat gcgctgatga caatcagcta 16020 

cttcactgtt ggggccgtgc ttgaggagca ggccggcgac agcgatgccg gcgagcgcgg 16080 

cggcaccgtt gaacaggctc cgctctcgcc gctgttgcgg gccgcgatag acgccttcga 16140 

cgaagecggt ccggacgcag cgttcgagca gggactcgcg gtgattgtcg atggattggc 16200 

gaaaaggagg ctcgttgtca ggaacgttga aggaccgaga aagggtgacg attgatcagg 16260 

accgctgccg gagcgcaacc cactcactac agcagagcca tgtagacaac atcccctccc 16320 

cctttccacc gcgtcagacg cccgtagcag cccgctacgg gctttttcat gccctgccct 16380 

agcgtccaag cctcacggcc gcgctcggcc tctctggcgg ccttctggcg ctcttccgct 16440 

tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac 16500 

tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga 16560 
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ccaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat 16620 

aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag ctgccgaaac I66SC 

ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct 16740 

gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg 16800 

cttttccgct gcataaccct gcttcggggt cattatagcg attttttcgg tatatccatc 16860 

ctttttcgca cgatatacag gattttgcca aagggttcgt gtagactttc cttggtgtat 16920 

ccaacggcgt ' cagccgggca ggataggtga agtaggccca cccgcgagcg ggtgttcctt 16980 

cttcactgtc ccttattcgc acctggcggt gctcaacggg aatcctgctc tgcgaggctg 17040 

gccggctacc gccggcgtaa cagatgaggg caagcggatg gctgatgaaa ccaagccaac 17100 

caggaagggc agcccaccta tcaaggtgta ctgccttcca gacgaacgaa gagcgattga 17160 

ggaaaaggcg gcggcggccg gcatgagcct gtcggcctac ctgctggccg tcggccaggg 17220 

ctacaaaatc acgggcgtcg tggactatga gcacgtccgc gagctggccc gcatcaatgg 17280 

cgacctgggc cgcctgggcg gcctgctgaa actctggctc accgacgacc cgcgcacggc 17340 

gcggttcggt gatgccacga tcctcgccct gctggcgaag atcgaagaga agcaggacga 17400 

gcttggcaag gtcatgatgg gcgtggtccg cccgagggca gagccatgac ttttttagcc 17460 

gctaaaacgg ccggggggtg cgcgtgattg ccaagcacgt ccccatgcgc tccatcaaga 17520 

agagcgactt cgcggagctg gtgaagtaca tcaccgacga gcaaggcaag accgagcgcc 17580 

tttgcgacgc tea 17593 



<210> 43 

<211> 16954 

<212> DNA 

<213> Artificial 
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<220> 

<223> Plastic: 



<220> 

<221> ir:isc_f eature 

<222> (10264) . . (10264) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10472) . . (10472) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10563) . . (10563) 

<223> n is a, c, g, or t 

<400> 43 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 60 

aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 120 

aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 180 

ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 300 

caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 
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tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 

atcggggcag raacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

ggtggcggcc t'gcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa ' cgacagcgac ttccgtccca gccgtgccag gtgctgcctc ' 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

• gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 
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tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttecccacca 19S0 

ccccgtcagt agctcaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aa.tcagtaag ttggcagcat cacccataat tgtggtttca 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaa'tgcaa ggaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 2460 

catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 • 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2.820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 
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ggtcgctggt attcgtgcag ggcaagettc ggaataccaa guacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta i:ctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc" gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4 140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 

gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 
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cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gacgacctgg 4500 

tgcattccaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctt'tactggc atitcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4620 

gcucgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 5460 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 
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tgatancgtc aacgttcect tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt rcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5320 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgcca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc • 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 
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gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tctrcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 8220 
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cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 8460 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccc-t aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 
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tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 
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gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt gagattaaaa 10800 

tagataagga aaagaaagtg aaaagaaatt cggaagcatg gcacattctt ctttttataa 10860 

atacatgcct gactttcttt ttccatcgat atgatatatg catatgatag atatacaagc 10920 

aatcttcttc aaggagtttg aaattttgtc ctccaggagc aaaaaaaagt ttttttttat 10980 

acatgtttgt acacaagaat agttaccaat ttgctttggt cttacgtgct gcaagtttat 11040 

atcgttttca atttctttgt ctttacattt tctttgtcct ttatctttcc tcatttagtc 11100 

tttgggagaa ttaggaaaag ggagcggaaa ggtaagaaat gcttgcgtat tttactaatt 11160 

cggcaaacat ccaatttggc aaacagcagc ctgtgcaacg ctctcgagat gacagtatct 11220 

ttgattacac tctaaatctc gatgacccga ccaaaaagag cgaacaaaga aataatcttg 11280 

tgcattcgaa tatgatggaa gattttttcc cccttattct aaatgttgac atagcgtgta 11340 

tgttatataa acaaaaagaa attgtacaaa ctttcttttc ttctcttttt attttatctc 11400 

tatgttgtgg atttggaatg ccctgatcgt tttcgttacc gtgattggca tggaagtgat 11460 

tgctgcactg gcacacaaat acatcatgca cggctggggt tggggatggc atctttcaca 11520 

tcatgaaccg cgtaaaggtg cgtttgaagt taacgatctt tatgccgtgg tttttgctgc 11580 

attatcgatc ctgctgattt atctgggcag tacaggaatg tggccgctcc agtggattgg 11640 

cgcaggtatg acggcgtatg gattactcta ttttatggtg cacgacgggc tggtgcatca 11700 

acgttggcca ttccgctata ttccacgcaa gggctacctc aaacggttgt atatggcgca 11760 

ccgtatgcat cacgccgtca ggggcaaaga aggttgtgtt tcttttggct tcctctatgc 11820 

gccgcccctg tcaaaacttc aggcgacgct ccgggaaaga catggcgcta gagcgggcgc 11880 

tgccagagat gcgcagggcg gggaggatga gcccgcatcc gggaagtaag ggcctgacca 11940 

gaggcggcca gcagcagcgt taatttttcg ggcgtggtcg ttgactgccg ctgatcccaa 12000 
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agcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt 12060 

ccacacaaca tacgagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagc 12120 

taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc 12180 

cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggccaaag 12240 

acaaaagggc gacattcaac cgattgaggg agggaaggta aatattgacg gaaattattc 12300 

attaaaggtg aattatcacc gtcaccgact tgagccattt gggaattaga gccagcaaaa 12360 

tcaccagtag caccattacc attagcaagg ccggaaacgt caccaatgaa accatcgata 12420 

gcagcaccgt aatcagtagc gacagaatca agtttgcctt tagcgtcaga ctgtagcgcg • 12480 

ttttcatcgg cattttcggt catagccccc ttattagcgt ttgccatctt ttcataatca 12540 

aaatcaccgg aaccagagcc accaccggaa ccgcctccct cagagccgcc accctcagaa 12600 

ccgccaccct cagagccacc accctcagag ccgccaccag aaccaccacc agagccgccg 12660 

ccagcattga caggaggccc gatctagtaa catagatgac accgcgcgcg ataatttatc 12720 

ctagtttgcg cgctatattt tgttttctat cgcgtattaa atgtataatt gcgggactct 12780 

aatcataaaa acccatctca taaataacgt catgcattac atgttaatta ttacatgctt 12840 

aacgtaattc aacagaaatt atatgataat catcgcaaga ccggcaacag gattcaatct 12900 

taagaaactt tattgccaaa tgtttgaacg atcggggatc atccgggtct gtggcgggaa 12960 

ctccacgaaa atatccgaac gcagcaagat atcgcggtgc atctcggtct tgcctgggca 13020 

gtcgccgccg acgccgttga tgtggacgcc gggcccgatc atattgtcgc tcaggatcgt 13080 

ggcgttgtgc ttgtcggccg ttgctgtcgt aatgatatcg gcaccttcga ccgcctgttc 13140 

cgcagagatc ccgtgggcga agaactccag catgagatcc ccgcgctgga ggatcatcca 13200 

gccggcgtcc cggaaaacga ttccgaagcc caacctttca tagaaggcgg cggtggaatc 13260 
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gaaatctcgt gatggcaggt tgggcgtcgc ttggtcggtc atttcgaacc ccagagtccc 13320 

gctcagaaga actcgtcaag aaggcgatag aaggcgatgc gctgcgaatc gggagcggcg 13380 

ataccgtaaa gcacgaggaa gcggtcagcc cattcgccgc caagctcttc agcaatatca 13440 

cgggtagcca acgctatgtc ctgatagcgg tccgccacac ccagccggcc acagtcgatg 13500 

aatccagaaa agcggccatt ttccaccatg atattcggca agcaggcatc gccatgggtc 13560 

acgacgagat catcgccgtc gggcatgcgc gccttgagcc tggcgaacag ttcggctggc 13620 

gcgagcccct gatgctcttc gtccagatca tcctgatcga caagaccggc ttccatccga 13680 

gtacgtgctc gctcgatgcg atgtttcgct tggtggtcga atgggcaggt agccggatc.a 13740 

agcgtatgca gccgccgcat tgcatcagcc atgatggata ctttctcggc aggagcaagg 13800 

tgagatgaca ggagatcctg ccccggcact tcgcccaata gcagccagtc ccttcccgct 13860 

tcagtgacaa cgtcgagcac agctgcgcaa ggaacgcccg tcgtggccag ccacgatagc 13920 

cgcgctgcct cgtcctgcag ttcattcagg gcaccggaca ggtcggtctt gacaaaaaga 13980 

accgggcgcc cctgcgctga cagccggaac acggcggcat cagagcagcc gattgtctgt 14040 

tgtgcccagt catagccgaa tagcctctcc acccaagcgg ccggagaacc tgcgtgcaat 14100 

ccatcttgtt caatcatgcg aaacgatcca gatccggtgc agattatttg gattgagagt 14160 

gaatatgaga ctctaattgg ataccgaggg gaatttatgg aacgtcagtg gagcattttt 14220 

gacaagaaat atttgctagc tgatagtgac cttaggcgac ttttgaacgc gcaataatgg 14280 

tttctgacgt atgtgcttag ctcattaaac tccagaaacc cgcggctgag tggctccttc 14340 

aacgttgcgg ttctgtcagt tccaaacgta aaacggcttg tcccgcgtca tcggcggggg 14400 

tcataacgtg actcccttaa ttctccgctc atgatcagat tgtcgtttcc cgccttcagt 14460 

ttaaactatc agtgtttgac aggatatatt ggcgggtaaa cctaagagaa aagagcgttt 14520 
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attagaataa tcggatattt aaaagggcgt gaaaaggttt atccgttcgt ccatttgtat 14580 

gtgcatgcca accacagggt tccccagatc tggcgccggc cagcgagacg agcaagattg 14 640 

gccgccgccc gaaacgatcc gacagcgcgc ccagcacagg tgcgcaggca aattgcacca 14700 

acgcatacag cgccagcaga atgccatagt gggcggtgac gtcgttcgag tgaaccagat 14760 

cgcgcaggag gcccggcagc accggcataa tcaggccgat gccgacagcg tcgagcgcga 14820 

cagtgctcag aattacgatc aggggtatgt tgggtttcac gtctggcctc cggaccagcc 14880 

tccgctggtc cgattgaacg cgcggattct ttatcactga taagttggtg gacatattat 14940 

gtttatcagt gataaagtgt caagcatcac aaagttgcag ccgaatacag tgatccgtgc 15000 

cgccctggac ctgttgaacg aggtcggcgt agacggtctg acgacacgca aactggcgga 15060 

acggttgggg gttcagcagc cggcgcttta ctggcacttc aggaacaagc gggcgctgct 15120 

cgacgcactg gccgaagcca tgctggcgga gaatcatacg cattcggtgc cgagagccga 15180 

cgacgactgg cgctcatttc tgatcgggaa tgcccgcagc ttcaggcagg cgctgctcgc 15240 

ctaccgcgat ggcgcgcgca tccatgccgg cacgcgaccg ggcgcaccgc agatggaaac 15300 

ggccgacgcg cagcttcgct tcctctgcga ggcgggtttt tcggccgggg acgccgtcaa 15360 

tgcgctgatg acaatcagct acttcactgt tggggccgtg cttgaggagc aggccggcga 15420 

cagcgatgcc ggcgagcgcg gcggcaccgt tgaacaggct ccgctctcgc cgctgttgcg 15480 

ggccgcgata gacgccttcg acgaagccgg tccggacgca gcgttcgagc agggactcgc 15540 

ggtgattgtc gatggattgg cgaaaaggag gctcgttgtc aggaacgttg aaggaccgag 15600 

aaagggtgac gattgatcag gaccgctgcc ggagcgcaac ccactcacta cagcagagcc 15660 

atgtagacaa catcccctcc ccctttccac cgcgtcagac gcccgtagca gcccgctacg 15720 

ggctttttca tgccctgccc tagcgtccaa gcctcacggc cgcgctcggc ctctctggcg 15780 
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gccttctggc gctcttccgc trcctcgctc actgactcgc tgcgctcggt cgttcggctg 15840 

cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat 15900 

aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc 15960 

gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc 16020 

tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga 16080 

agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt 16140 

ctcccttcgg gaagcgtggc gcttttccgc tgcataaccc tgcttcgggg tcattatagc 16200 

gattttttcg gtatatccat cctttttcgc acgatataca ggattttgcc aaagggttcg 16260 

tgtagacttt ccttggtgta tccaacggcg tcagccgggc aggataggtg aagtaggccc 16320 

acccgcgagc gggtgttcct tcttcactgt cccttattcg cacctggcgg tgctcaacgg 16380 

gaatcctgct ctgcgaggct ggccggctac cgccggcgta acagatgagg gcaagcggat 16440 

ggctgatgaa accaagccaa ccaggaaggg cagcccacct atcaaggtgt actgccttcc 16500 

agacgaacga agagcgattg aggaaaaggc ggcggcggcc ggcatgagcc tgtcggccta 16560 

cctgctggcc gtcggccagg gctacaaaat cacgggcgtc gtggactatg agcacgtccg 16620 

cgagctggcc cgcatcaatg gcgacctggg ccgcctgggc ggcctgctga aactctggct 16680 

caccgacgac ccgcgcacgg cgcggttcgg tgatgccacg atcctcgccc tgctggcgaa 16740 

gatcgaagag aagcaggacg agcttggcaa ggtcatgatg ggcgtggtcc gcccgagggc 16800 

agagccatga cttttttagc cgctaaaacg gccggggggt gcgcgtgatt gccaagcacg 16860 

tccccatgcg ctccatcaag aagagcgact tcgcggagct ggtgaagtac atcaccgacg 16920 

agcaaggcaa gaccgagcgc ctttgcgacg ctca 16954 

<210> 44 



BASF NAE 878/03 



198/357 



<211> 
<2I2> 
<213> 

<220> 
<223> 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<400> 



16954 
DNA 

Artificial 
Plasmid 



misc_f eature 
(10264) . . (10264) 
n is a, c, g, or t 

mi sc_f eature 
(10472) . . (10472) 
n is a, c, g, or t 

mi sc_f eature 
(10563) . . (10563) 
n is a, c, g, or t 



ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 



60 



aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 



120 



aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 



180 



ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 



240 



cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 



300 



caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 



360 



gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 



420 



tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 



480 



ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 



540 
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tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 

atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc .gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 
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ggccaacgcc cat aargcgg gctgttgccc ggcatccaac ocean tca~g gccacarcaa I860 

toattttctg giccctaccg ggt tgagaag cggtgraagt gaactgcagr tgccatgttt 1320 

tacggcagtg agagcagaga tagegctgat gtccggcggt gettttgecg ttacgcacca 1980 

ccccgtcagt agctgaacag gagggacagc tgatagacac .agaagecact ggagcaccrc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgrggtttca 2100 

aaateggetc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggra tttaaggttt tagaatgeaa ggaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcacegg aattgaaaaa actgatcgaa aaataccget gegtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 2460 

catgargcta tggctggaag gaaagctgee tgttccaaag gtcctgcact ttgaaeggea 2520 

tgatggctgg ageaatctge tcatgagtga ggccgatggc gtcctttget eggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gaeatategg attgtcccta tacgaatagc ttagacagee gettagcega 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgegaaaact gggaagaaga 2760 

cactccattt aaagatcege gcgagctgta tgatttttta aagacggaaa ageccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag eggcagggeg gacaagtggt atgacattgc 2940 

cttctgcgtc eggtcgatea gggaggatat eggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 
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attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagrgtrtt: ggctctcagg ccgaggccca cggcaagtat trgggcaagg 3180 

ggtcgctggt antcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggtttrccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtt tgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 
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ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagcrg gaaaccttcc 4380 

gcctcatgrc cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4 440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa ac'gctagggc cttgtggggt cagttccggc t.gggggttca gcagccagcg 4 560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4 620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4 6S0 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggv: cttcaaacag gaggacggcc ccaaggacgc 4 920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 5460 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 552C 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 
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ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacactcgc aacttcgggt ccccgacctg taccattcgg tcagcaatgg ataggggagr 5700 

rgatatcgtc aacgrtcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgr cacggtraag 5820 

cgagaaatga aiaagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5380 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 
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cgcccagcct 
gctcattaac 
gtacaccaac 
aaga uccacc 
aacagcttic 
aagaactgga 
cggtgtcaag 
tagcatagct 
tcctttggct 
tattcaccaa 
.tcgaaataca 
gcactattga 
tagcagt t cc 
tctctagtta 
gtaaggattt 
atttaattct 
cgtatagcta 
ctttaagcct 
tgtccagatc 
cctagctgat 
accatttgta 



cgtcctgtca 
acccagitca 
gctcttctga 
tcttcgggaa 
tcagccaggg 
ggggtggtgt 
aggatattct 
tcatttggat 
ttcaatattc 
tgagcattgg 
tccggatgtc 
tcatccgata 
aggtggaatg 
gctcaccaca 
cggcacggct 
atttgtgttt 
ccagtgagga 
aaactataca 
atggttgacc 
tctggagtga 
gaaaaatgtg 



caactaccaa 
tgggcgttgg 
tccagtcgat 
gccagcgact 
ccagcccaag 
caaggaggag 
tcgactctgt 
ttgctttcca 
tcaagtatct 
aatgaacatg 
gaaggcttgg 
gctctgcaaa 
ttatgat gag 
aaagtcagac 
acggaagacg 
gatcgagacc 
agtggactca 
gaataagata 
ggtgcctgga 
cccagagggt 
acgaactcgt 



catcgactac 
catgatggcc 
catccgctga 
ggtgacctcc 
accgacaagg 
taagctcctt 
attatagata 
ggctgagact 
cgagtttgaa 
aatctgagga 
ggcacctgcg 
gggcgttgca 
cattgtatta 
ggcgtaacca 
gagaagccac 
taatacagcc 
aatcgacttc 
ggtggagagc 
tcttcctata 
catgacttga 
gagctctgta 



gataagggcc 
gtcatgcatc 
aggcgctttc 
agcgtccct t 
cctccctcca 
attgaagtcg 
agatgatgag 
ctagcttgga 
cttattccct 
ctgcaatcgc 
ttggttgaat 
caatgcaagt 
aatcaggaga 
aaagtcacac 
cttcagtgga 
cctacaacga 
agcaacatct 
ttataccgag 
gaatcatcct 
gcctaaaatc 
cagtgaccgg 



agttccccca 

tctacttcaa 

caatctggtt 

taaggctgcc 

gaacgccgag 

gaggacggag 

gaattggagg 

gcatagaggg 

gtgaaccttt 

catgaggttt 

ttagaacgtg 

caaacgttgc 

tatagcatga 

aacacaagct 

ctcgagtacc 

ccatcaaagt 

cctggataaa 

ctcccaaatc 

tattcgttga 

cgccgcctcc 

tgactctttc 



6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 
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tggcatgcgg aqagacccac ggacgcagag agaagggctg agtaataagc cactggccag S160 

acacctctcc ccgctctgag gtgcagtgga tgattattaa tccgggaccg cccccccctc 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcticatcga arcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 8460 

gcgiaagctc cctaattggc ccatccggca tctgtagggc gtccaaatar cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc . ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac ■ atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9.000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 
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accgcaagga atcggtcaat acactacang gcgtgatttc atatgcgcga tigctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcagcc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggar trcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgacig 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 97S0 ■ 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 
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taacaccc=c tacgccggcc gaaacttttt tacaactctc ctargagtcg tttacccaga 10680 

atccacaggt acacttgttt agaggtaatc cttctttcta cctacaaczc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt agagataaaa 10800 

taaaaagaca agaaaagaaa gtttgtacaa tttctttttg tttatataac atacacgcta I0S60 

tgtcaacart tagaataagg gggaaaaaat cttccatcat attcgaatgc acaagattat 10920 

ttctttgttc gctctttttg gtcgggtcat cgagatttag agtgtaatca aagatactgt 10980 

catctcgaga gcgttgcaca ggctgctgtt tgccaaattg gatgtttgcc gaattagtaa 11040 

aatacgcaag catttcttac ctttccgctc ccttttccta attctcccaa agactaaatg 11100 

aggaaagata aaggacaaag aaaatgtaaa gacaaagaaa ttgaaaacga tataaacttg 11160 

cagcacgtaa gaccaaagca aattggtaac tattcttgtg tacaaacatg tataaaaaaa 11220 

aacttttttt tgctcctgga ggacaaaatt tcaaactcct tgaagaagat tgcttgtata 11280 

tctatcatat gcatatatca tatcgatgga aaaagaaagt caggcatgta tttataaaaa 11340 

gaagaatgtg ccatgcttcc gaatttcttt tcactttctt ttccttatct attttaatct 11400 

catgttgtgg atttggaatg ccctgatcgt tttcgttacc gtgattggca tggaagtgat 11460 

tgctgcactg gcacacaaat acatcatgca cggctggggt tggggatggc atctttcaca 11520 

tcatgaaccg cgtaaaggtg cgtttgaagt taacgatctt tatgccgtgg tttttgctgc 11580 

attatcgatc ctgctgattt atctgggcag tacaggaatg tggccgctcc agtggattgg 11640 

cgcaggtatg acggcgtatg gattactcta ttttatggtg cacgacgggc tggtgcatca 11700 

acgttggcca ttccgctata ttccacgcaa gggctacctc aaacggttgt atatggcgca 11760 

ccgtatgcat cacgccgtca ggggcaaaga aggttgtgtt tcttttggct tcctctatgc 11820 

gccgcccctg tcaaaacttc aggcgacgct ccgggaaaga catggcgcta gagcgggcgc 11880 
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tgccagagat gcgcagggcg gggaggatga gcccgcatcc gggaagtaag cgccngacca 11940 

gaggcggcca gcagcagcgt taatttttcg ggcgtggtcg ttgactgccg ctgatcccaa 12000 

agcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaati 12060 

ccacacaaca tacgagccgg aagcataaag tgtaaagcct ggggtgccta atgagrgagc 12120 

taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc 121S0 

cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggccaaag 12240 

acaaaagggc gacattcaac cgattgaggg agggaaggta aatattgacg gaaattattc 12300 

attaaaggtg aattatcacc gtcaccgact tgagccattt gggaattaga gccagcaaaa 12360 

tcaccagtag caccattacc attagcaagg ccggaaacgt caccaatgaa accatcgata 12420 

gcagcaccgt aatcagtagc gacagaatca agtttgcctt tagcgtcaga ctgtagcgcg 12480 

ttttcatcgg cattttcggt catagccccc ttattagcgt ttgccatctt ttcataatca 12540 

aaatcaccgg aaccagagcc accaccggaa ccgcctccct cagagccgcc accctcagaa 12600 

ccgccaccct cagagccacc accctcagag ccgccaccag aaccaccacc agagccgccg 12660 

ccagcattga caggaggccc gatctagtaa catagatgac accgcgcgcg ataatttatc 12720 

ctagtttgcg cgctatattt tgttttctat cgcgtattaa atgtataatt gcgggactct 12780 

aatcataaaa acccatctca taaataacgt catgcattac atgttaatta ttacatgctt ■ 12840 

aacgtaattc aacagaaatt atatgataat catcgcaaga ccggcaacag gattcaatct 12900 

taagaaactt tattgccaaa tgtttgaacg atcggggatc atccgggtct gtggcgggaa 12960 

ctccacgaaa atatccgaac gcagcaagat atcgcggtgc atctcggtct tgcctgggca 13020 

gtcgccgccg acgccgttga tgtggacgcc gggcccgatc atattgtcgc tcaggatcgt 13080 

ggcgttgtgc ttgtcggccg ttgctgtcgt aatgatatcg gcaccttcga ccgcctgttc 13140 
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cgcagagatc ccgtgggcga agaactccag catgagatcc ccgcgctgga ggatcatcca 13200 

gccggcgtcc cggaaaacga ttccgaagcc caaccttrca tagaaggcgg cggrggaatc 13260 

gaaatctcgt gatggcaggt tgggcgtcgc ttggtcggtc atttcgaacc ccagagiccc 13320 

gctcagaaga acicgccaag aaggcgatag aaggcgatgc gctgcgaacc gggagcggcg 13380 

ataccgtaaa gcacgaggaa gcggtcagcc cattcgccgc caagctcttc agcaatatca 13440 

cgggtagcca acgctatgtc ctgatagcgg tccgccacac ccagccggcc acagtcgatg 13500 

aatccagaaa agcggccatt ttccaccatg atattcggca agcaggcatc gccatgggtc 13560 

acgacgagat catcgccgtc gggcatgcgc gccttgagcc tggcgaacag ttcggctggc 13620 

gcgagcccct gatgctcttc gtccagatca tcctgatcga caagaccggc ttccatccga 13680 

gtacgtgctc gctcgatgcg atgtttcgct tggtggtcga atgggcaggt agccggatca 13740 

agcgtatgca gccgccgcat tgcatcagcc atgatggata ctttctcggc aggagcaagg 13800 

tgagatgaca ggagatcctg ccccggcact tcgcccaata gcagccagtc ccttcccgct 13860 

tcagtgacaa cgtcgagcac agctgcgcaa ggaacgcccg tcgtggccag ccacgatagc 13920 

cgcgctgcct cgtcctgcag ttcattcagg gcaccggaca ggtcggtctt gacaaaaaga 13980 

accgggcgcc cctgcgctga cagccggaac acggcggcat cagagcagcc gattgtctgt 14040 

tgtgcccagt catagccgaa tagcctctcc acccaagcgg ccggagaacc tgcgtgcaat 14100 

ccatcttgtt caatcatgcg aaacgatcca gatccggtgc agattatttg gattgagagt 14160 

gaatatgaga ctctaattgg atacc.gaggg gaatttatgg aacgtcagtg gagcattttt 14220 

gacaagaaat atttgctagc tgatagtgac cttaggcgac ttttgaacgc gcaataatgg 14280 

tttctgacgt atgtgcttag ctcattaaac tccagaaacc cgcggctgag tggctccttc 14340 

aacgttgcgg ttctgtcagt tccaaacgta aaacggcttg tcccgcgtca tcggcggggg 14400 
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tcataacgtc actcccttaa ttctccgctc atgatcagat tgtcgtttcc ccccttcagt 14460 

ttaaactatc agtgtrtgac aggatatatt ggcgggtaaa cctaagacaa aacaccgttt 14520 

attagaaiaa tcggatattt aaaagggcgt gaaaaggttt atccgttcgt ccatt tctat 14580 

gtgcatgcca accacagggt tccccagatc tggcgccggc cagcgagacg agcaacatig 14640 

gccgccgccc gaaacgatcc gacagcgcgc ccagcacagg tgcgcaggca aattgcacca 14700 

acgcatacag cgccagcaga atgccatagt gggcggtgac gtcgttcgag tgaaccagat 14760 

cgcgcaggag gcccggcagc accggcataa tcaggccgat gccgacagcg tcgagcgcga 14820 

cagtgctcag aattacgatc aggggtatgt tgggtttcac gtctggcctc cggaccagcc 14880 

tccgctggtc cgattgaacg cgcggattct ttatcactga taagttggtg gacatattat 14940 

gtttatcagt gataaagtgt caagcatgac aaagttgcag ccgaatacag tgatccgtgc 15000 

cgccctggac ctgttgaacg aggtcggcgt agacggtctg acgacacgca aactggcgga 15060 

acggttgggg gttcagcagc cggcgcttta ctggcacttc aggaacaagc gggcgctgct 15120 

cgacgcactg gccgaagcca tgctggcgga gaatcatacg cattcggtgc cgagagccga 15180 

cgacgactgg cgctcatttc tgatcgggaa tgcccgcagc ttcaggcagg cgctgctcgc 15240 

ctaccgcgat ggcgcgcgca tccatgccgg cacgcgaccg ggcgcaccgc agatggaaac 15300 

ggccgacgcg cagcttcgct tcctctgcga ggcgggtttt tcggccgggg acgccgtcaa 15360 

tgcgctgatg acaatcagct acttcactgt tggggccgtg cttgaggagc aggccggcga 15420 

cagcgatgcc ggcgagcgcg gcggcaccgt tgaacaggct ccgctctcgc cgctgttgcg 15480 

ggccgcgata gacgccttcg acgaagccgg tccggacgca gcgttcgagc agggactcgc 15540 

ggtgattgtc gatggattgg cgaaaaggag gctcgttgtc aggaacgttg aaggaccgag 15600 

aaagggtgac gattgatcag gaccgctgcc ggagcgcaac ccactcacta cagcagagcc 15660 
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atgiagacaa catcccctcc ccctttccac cgcgtcagac qcccgtagca gcccgctacg 15720 

gcctrtttca tgccctcccc tagcgtccaa gcctcacggc cgcgctcggc czczctqqcc 15780 

gccttctggc gcccttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg 15340 

■ cggcgagcgc tarcagcica ctcaaaggcg gtaatacggt tatccacaga atcaggggat 15900 

aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc 15960 

gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc 16020 

tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga 16080 

agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt 16140 

ctcccttcgg gaagcgtggc gcttttccgc tgcataaccc tgcttcgggg tcattatagc 16200 

gattttttcg gtatatccat cctttttcgc acgatataca ggattttgcc aaagggttcg 16260 

tgtagacttt ccttggtgta tccaacggcg tcagccgggc aggataggtg aagtaggccc 16320 

acccgcgagc gggtgttcct tcttcactgt cccttattcg cacctggcgg tgctcaacgg 16380 

gaatcctgct ctgcgaggct ggccggctac cgccggcgta acagatgagg gcaagcggat 16440 

ggctgatgaa accaagccaa ccaggaaggg cagcccacct atcaaggtgt actgccttcc 16500 

agacgaacga agagcgattg aggaaaaggc ggcggcggcc ggcatgagcc tgtcggccta 16560 

cctgctggcc gtcggccagg gctacaaaat cacgggcgtc gtggactatg agcacgtccg 16620 

cgagctggcc cgcatcaatg gcgacctggg ccgcctgggc ggcctgctga aactctggct 16680 

caccgacgac ccgcgcacgg cgcggttcgg tgatgccacg atcctcgccc tgctggcgaa 16740 

gatcgaagag aagcaggacg agcttggcaa ggtcatgatg ggcgtggtcc gcccgagggc 16800 

agagccatga cttttttagc cgctaaaacg gccggggggt gcgcgtgatt gccaagcacg 16860 

tccccatgcg ctccatcaag aagagcgact tcqcggagct ggtgaagtac atcaccgacq 16920 
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212/357 



agcaagccaa gaccgagcgc ctttgcgacg ctca 16954 

<2I0> 45 

<211> I949I 

<212> DNA 

<2I3> Artificial 

<220> 

<223> Piasmid 
<220> 

<221> misc_feature 

<222> (18970) . . (18970) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (19178) . . (19178) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (19269) . . (19269) 

<223> n is a, c, g, or t 

<400> 45 

agcttggtac cgagctcgga tccactagta acggccgcca gtgtgctgga attcgccctt 60 

gacggccagt gaattcgagc tcggtacccg gggatctttc gacactgaaa tacgtcgagc 120 

ctgctccgct tggaagcggc gaggagcctc gtcctgtcac aactaccaac atggagtacg 180 

ataagggcca gttccgccag ctcattaaga gccagttcat gggcgttggc atgatggccg 240 

tcatgcatct gtacttcaag tacaccaacg ctcttctgat ccagtcgatc atccgctgaa 300 

ggcgctttcg aatctggtta agatccacgt cttcgggaag ccagcgactg gtgacctcca 360 



gcgtcccttt aaggctgcca acagctttct cagccagggc cagcccaaga ccgacaaggc 420 
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ctccctccag aacgccgaga agaactggag gggtggtgtc aaggaggagt aagctcctta 480 

ttgaagtccc aggacggagc ggtgtcaaga ggatattctt ccactctgta ttatagaraa 540 

gatgargagc aattggagct agcatagctt catttggatt tgctttccag gctgagactc 600 

■cagcttggag catagagggt cctttggctt tcaatattct caagtatctc gagttigaac 660 

ttattccctg tgaacctttt attcaccaat gagcattgga atgaacatga atctgaggac 720 

tgcaatcgcc atgaggtttt cgaaatacat ccggatgtcg aaggcttggg gcacctgcgt 780 

tggttgaatt tagaacgtgg cactattgat catccgatag ctctgcaaag ggcgttgcac 840 

aatgcaagtc aaacgttgct agcagttcca ggtggaatgt tatgatgagc attgtattaa 900 

atcaggagat atagcatgat ctctagttag ctcaccacaa aagtcagacg gcgtaaccaa 960 

aagtcacaca acacaagctg taaggatttc ggcacggcta cggaagacgg agaagccacc 1020 

ttcagtggac tcgagtacca tttaattcta tttgtgtttg atcgagacct aatacagccc 1080 

ctacaacgac catcaaagtc gtatagctac cagtgaggaa gtggactcaa atcgacttca 1140 

gcaacatctc ctggataaac tttaagccta aactatacag aataagatag gtggagagct 1200 

tataccgagc tcccaaatct gtccagatca tggttgaccg gtgcctggat cttcctatag 1260 

aatcatcctt attcgttgac ctagctgatt ctggagtgac ccagagggtc atgacttgag 1320 

cctaaaatcc gccgcctcca ccatttgtag aaaaatgtga cgaactcgtg agctctgtac 1380 

agtgaccggt gactctttct ggcatgcgga gagacggacg gacgcagaga gaagggctga 1440 

gtaataagcc actggccaga cagctctggc ggctctgagg tgcagtggat gattattaat 1500 

ccgggaccgg ccgcccctcc gccccgaagt ggaaaggctg gtgtgcccct cgttgaccaa 1560 

gaatctattg catcatcgga gaatatggag cttcatcgaa tcaccggcag taagcgaagg 1620 

agaatgtgaa gccaggggtg tatagccgtc ggcgaaatag catgccatta acctaggtac 1680 
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agaagtccaa ttgcttccga tctggtaaaa gattcacgag atagtaccrt crccgaagta 1740 

gctagagcga ciaccccccg cgtaagctcc ctaattggcc catccggcat crctagggcg 1800 

tccaaatatc gtgcctctcc tgctttgccc ggtgtatgaa accggaaagg ccgctcagga 1860 

gctggccagc ggcgcagacc gggaacacaa gctggcagtc gacccatccg gtgctcticca 1920 

ctcgacctgc tgaggtccct cagtccctgg taggcagctt tgccccgtct gtccgcccgg 1980 

tgtgtcggcg gggttgacaa ggtcgttgcg tcagtccaac atttgttgcc atattttcct 2040 

gctctcccca ccagctgctc ttttcttttc tctttctttt cccatcttca gtatattcat 2100 

cttcccatcc aagaaccttt atttccccta agtaagtact ttgctacatc catactccat 2160 

ccttcccatc ccttattcct ttgaaccttt cagttcgagc tttcccactt catcgcagct 2220 

tgactaacag ctaccccgct tgagcagaca tcaccatgct gtcgaagctg cagtcaatca 2280 

gcgtcaaggc ccgccgcgtt gaactagccc gcgacatcac gcggcccaaa gtctgcctgc 2340 

atgctcagcg gtgctcgtta gttcggctgc gagtggcagc accacagaca gaggaggcgc 2400 

tgggaaccgt gcaggctgcc ggcgcgggcg atgagcacag cgccgatgta gcactccagc 24 60 

agcttgaccg ggctatcgca gagcgtcgtg cccggcgcaa acgggagcag ctgtcatacc 2520 

aggctgccgc cattgcagca tcaattggcg tgtcaggcat tgccatcttc gccacctacc 2580 

tgagatttgc catgcacatg accgtgggcg gcgcagtgcc atggggtgaa gtggctggca 2640 

ctctcctctt ggtggttggt ggcgcgctcg gcatggagat gtatgcccgc tatgcacaca 2700 

aagccatctg gcatgagtcg cctctgggct ggctgctgca caagagccac cacacacctc 2760 

gcactggacc ctttgaagcc aacgacttgt ttgcaatcat caatggactg cccgccatgc 2820 

tcctgtgtac ctttggcttc tggctgccca acgtcctggg ggcggcctgc tttggagcgg 2880 

ggctgggcat cacgctatac ggcatggcat atatgtttgt acacgatggc ctggtgcaca 2940 
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ggcgctttcc caccggcccc atcgctggcc tgccctacat gaagcgccrg acactggccc 3000 

accacciaca ccacagcggc aactacggtg gcgcgccctg gcctatgttc trcggrccac 3060 

aggagctgca gcacattcca ggtgcggcgg aggaggtgga gcgactggtc ctggaactgg 3120 

actggtccaa gcggtagggt gcggaaccag gcacgctggt ttcacacctc atccctgtga 3180 

taaggtgtgg ctagagcgat gcgtgtgaga cgggtatgtc acggtcgact ggtcrgatgg 3240 

ccaatggcat cggccatgtc tggtcatcac gggctggttg cctgggtgaa ggrgatgcac 3300 

atcatcatgt gcggttggag gggctggcac agtgtgggct gaactggagc agttgtccag 3360 

gctggcgttg aatcagtgag ggtttgtgat tggcggttgt gaagcaatga ctccgcccat 3420 

attctatttg tgggagctga gatgatggca tgcttgggat gtgcatggat catggtagtg 3480 

cagcaaacta tattcaccta gggctgttgg taggatcagg tgaggccttg cacattgcat 3540 

gatgtactcg tcatggtgtg ttggtgagag gatggatgtg gatggatgtg tattctcaga 3600 

cgtagacctt gactggaggc ttgatcgaga gagtgggccg tattctttga gaggggaggc 3660 

tcgtgccaga aatggtgagt ggatgactgt gacgctgtac attgcaggca ggtgagatgc 3720 

actgtctcga ttgtaaaata cattcagatg caagcttggc gtaatcatgg tcatagctgt 3780 

ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa 3840 

agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac 3900 

tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 3960 

cggggagagg cggtttgcgt attgggccaa agacaaaagg gcgacattca accgattgag 4020 

ggagggaagg taaatattga cggaaattat tcattaaagg tgaattatca ccgtcaccga 4080 

cttgagccat ttgggaatta gagccagcaa aatcaccagt agcaccatta ccattagcaa 4140 

ggccggaaac gtcaccaatg aaaccatcga tagcagcacc gtaatcagta gcgacagaat 4200 
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caagtttgcc tttagcgtca gactgtagcg cgttttcat c ggcattttcc gtcatagccc 4260 

ccitatiacc gtttcccatc ttttcataat caaaatcacc ggaaccagag ccaccaccgg 4320 

aaccgccrcc ctcagagccg ccaccctcag aaccgccacc ctcagagcca ccaccctcag 4380 

agccgccacc agaaccacca ccagagccgc cgccagcatt gacaggaggc ccgatctagi 4440 

aacatagatg acaccgcgcg cgataattta tcctagtttg cgcgctatat tttgttttct 4500 

atcgcgtatt aaatgtataa ttgcgggact ctaatcataa aaacccatct cataaataac 4560 

gtcatgcatt acatgttaat tattacatgc ttaacgtaat tcaacagaaa ttatatgata 4620 

atcatcgcaa gaccggcaac aggattcaat cttaagaaac tttattgcca aatgtttgaa 4680 

cgatcgggga tcatccgggt ctgtggcggg aactccacga aaatatccga acgcagcaag 4740 

atatcgcggt gcatctcggt cttgcctggg cagtcgccgc cgacgccgtt gatgtggacg 4800 

ccgggcccga tcatattgtc gctcaggatc gtggcgttgt gcttgtcggc cgttgctgtc 4860 • 

gtaatgatat cggcaccttc gaccgcctgt tccgcagaga tcccgtgggc gaagaactcc 4920 

agcatgagat ccccgcgctg gaggatcatc cagccggcgt cccggaaaac gattccgaag 4980 

cccaaccttt catagaaggc ggcggtggaa tcgaaatctc gtgatggcag gttgggcgtc 5040 

gcttggtcgg • tcatttcgaa ccccagagtc ccgctcagaa gaactcgtca agaaggcgat 5.100 

agaaggcgat gcgctgcgaa tcgggagcgg cgataccgta aagcacgagg aagcggtcag 5160 

cccattcgcc gccaagctct tcagcaatat cacgggtagc caacgctatg tcctgatagc 5220 

ggtccgccac acccagccgg ccacagtcga tgaatccaga aaagcggcca ttttccacca 5280 

tgatattcgg caagcaggca tcgccatggg tcacgacgag atcatcgccg tcgggcatgc 5340 

gcgccttgag cctggcgaac agttcggctg gcgcgagccc ctgatgctct tcgtccagat 5400 

catcctgatc gacaagaccg gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg 5460 
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cttggtggtc gaat cggcac gtagccggat caagcgtatg cagccgccgc attgcatcag 5520 

ccatgatgga cactttctcg gcaggagcaa ggrgagatga caggagatcc tgccccggca 5580 

cttcgcccaa lagcagccag tcccttcccg cttcagtgac aacgtcgagc acacctqcgc 5640 

aaggaacgcc cgtcgrggcc agccacgata gccgcgctgc ctcgtcctgc acttcattca 5700 

gggcaccgga caggtcggtc ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga 5760 

acacggcggc atcagagcag ccgattgtct gttgtgccca gtcatagccg aatagcctct 5820 

ccacccaagc* ggccggagaa cctgcgtgca atccatcttg ttcaatcatg cgaaacgatc 5880 

cagatccggt gcagattatt tggattgaga gtgaatatga gactctaatt ggataccgag 5940 

gggaatttat ggaacgtcag tggagcattt ttgacaagaa atatttgcta gctgatagtg 6000 

accttaggcg acttttgaac gcgcaataat ggtttctgac gtatgtgctt agctcattaa 6060 

actccagaaa cccgcggctg agtggctcct tcaacgttgc ggttctgtca gttccaaacg 6120 

taaaacggct tgtcccgcgt catcggcggg ggtcataacg tgactccctt aattctccgc 6180 

tcatgatcag attgtcgttt cccgccttca gtttaaacta tcagtgtttg acaggatata 6240 

ttggcgggta aacctaagag aaaagagcgt ttattagaat aatcggatat ttaaaagggc 6300 

gtgaaaaggt ttatccgttc gtccatttgt atgtgcatgc caaccacagg gttccccaga 6360 

tctggcgccg gccagcgaga cgagcaagat tggccgccgc ccgaaacgat ccgacagcgc 6420 

gcccagcaca ggtgcgcagg caaattgcac caacgcatac agcgccagca gaatgccata 6480 

gtgggcggtg acgtcgttcg agtgaaccag atcgcgcagg aggcccggca gcaccggcat 6540 

aatcaggccg atgccgacag cgtcgagcgc gacagtgctc agaattacga tcaggggtat 6600 

gttgggtttc acgtctggcc tccggaccag cctccgctgg tccgattgaa cgcgcggatt 6660 

ctttatcact gataagttgg tggacatatt atgtttatca gtgataaagt gtcaagcatg 6720 
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acaaagttgc agccgaatac actgatccgt gccgccctgg acctgtrgaa cgaggrcggc 6780 

gcagacggtc rgacgacacg caaacrggcg gaacggttgg gggtrcagca gcccgcgctt 6340 

tactggcact tcaggaacaa gcgggcgctg ctcgacgcac tggccgaagc catgctggcg 6900 

gagaatcata cgcatrcggt gccgagagcc gacgacgact ggcgctcatr tcngarcggg 6960 

aatgcccgca gcttcaggca ggcgctgctc gcctaccgcg atggcgcgcg catccatgcc 7020 

ggcacgcgac cgggcgcacc gcagatggaa acggccgacg cgcagcttcg cttcctctgc 7080 

gaggcgggtc tttcggccgg ggacgccgtc aatgcgctga tgacaatcag ctacitcact 7140 

gttggggccg tgcttgagga gcaggccggc gacagcgatg ccggcgagcg cggcggcacc 7200 

gttgaacagg ctccgctctc gccgctgttg cgggccgcga tagacgcctt cgacgaagcc 7260 

ggtccggacg cagcgttcga gcagggactc gcggtgattg tcgatggatt ggcgaaaagg 7320 

aggctcgttg tcaggaacgt tgaaggaccg agaaagggtg acgattgatc aggaccgctg 7380 

ccggagcgca acccactcac tacagcagag ccatgtagac aacatcccct ccccctttcc 7440 

accgcgtcag acgcccgtag cagcccgcta cgggcttttt catgccctgc cctagcgtcc 7500 

aagcctcacg gccgcgctcg gcctctctgg cggccttctg gcgctcttcc gcttcctcgc 7560 

tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7620 

cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7680 

gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7740 

gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 7800 

gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 7860 

ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgcttttcc 7920 

gctgcataac cctgcttcgg ggtcattata gcgatttttt cggtatatcc atcctttttc 7980 
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gcacgaiata caggatttcg ccaaagggrt cgtgtagact ttccttggtg tatccaacgg 8040 

cgtcagccgc gcaggatagg tgaagtaggc ccacccgcca gcgggtgtrc cttctticact S100 

gtcccttatt cgcacctggc ggtgctcaac gggaatcctg ctctgcgagg ctggccggcr S160 

accgccggcg raacagatga gggcaagcgg atggctgatg aaaccaagcc aaccaggaag 8220 

ggcagcccac ctatcaaggt gtactgcctt ccagacgaac gaagagcgat tgaggaaaag 8280 

gcggcggcgg ccggcatgag cctgtcggcc tacctgctgg ccgtcggcca gggctacaaa 8340 

atcacgggcg tcgtggacta tgagcacgtc cgcgagctgg cccgcatcaa tggcgacctg 8400 

ggccgcctgg gcggcctgct gaaactctgg ctcaccgacg acccgcgcac ggcgcggttc 8460 

ggtgatgcca cgatcctcgc cctgctggcg aagatcgaag agaagcagga cgagcttggc 8520 

aaggtcatga tgggcgtggt ccgcccgagg gcagagccat gactttttta gccgctaaaa 8580 
cggccggggg gtgcgcgtga ttgccaagca cgtccccatg egctccatca agaagagcga • 8640 

cttcgcggag ctggtgaagt acatcaccga cgagcaaggc aagaccgagc gcctttgcga 8700 

cgctcaccgg gctggttgcc ctcgccgctg ggctggcggc cgtctatggc cctgcaaacg 8760 

cgccagaaac gccgtcgaag ccgtgtgcga gacaccgcgg ccgccggcgt tgtggatacc 8820 

tcgcggaaaa cttggccctc actgacagat gaggggcgga cgttgacact tgaggggccg 8880 

actcacccgg cgcggcgttg acagatgagg ggcaggctcg atttcggccg gcgacgtgga 8940 

gctggccagc ctcgcaaatc ggcgaaaacg cctgatttta cgcgagtttc ccacagatga 9000 

tgtggacaag cctggggata agtgccctgc ggtattgaca cttgaggggc gcgactactg 9060 

acagatgagg ggcgcgatcc ttgacacttg aggggcagag tgctgacaga tgaggggcgc 9120 

acctattgac atttgagggg ctgtccacag gcagaaaatc cagcatttgc aagggtttcc 9180 

gcccgttttt cggccaccgc taacctgtct tttaacctgc ttttaaacca atatttataa 9240 
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acctigtttt taaccagggc tgcgccctgt gcgcgtgacc gcgcacgccg aaggggggtg 9300 

ccccccctic tcgaaccctc ccggcccgct aacgcgggcc tcccatcccc ccacgcgcic 9360 

cgcccctcgg ccgcgaacgg cctcacccca aaaatggcag cgctggcagt ccttgccatt 9420 

gccgggatcg gcgcagtaac gggatgggcg atcagcccga gcgcgacgcc cggaagcatt 9480 

gacgtgccgc aggtgctggc atcgacattc agcgaccagg tgccgggcag tgagggcggc 9540 

ggcctgggtg gcggcctgcc cttcacttcg gccgtcgggg cattcacgga cttcatggcg 9600 

gggccggcaa tttttacctt gggcattctt ggcatagtgg tcgcgggtgc cgtgctcgtg 9660 

ttcgggggtg cgataaaccc agcgaaccat ttgaggtgat aggtaagatt ataccgaggt 9720 

atgaaaacga gaattggacc tttacagaat tactctatga agcgccatat ttaaaaagct 9780 

accaagacga agaggatgaa gaggatgagg aggcagattg ccttgaatat attgacaata 9840 

ctgataagat aatatatctt ttatatagaa gatatcgccg tatgtaagga tttcaggggg 9900 

caaggcatag gcagcgcgct tatcaatata tctatagaat gggcaaagca taaaaacttg 9960 

catggactaa tgcttgaaac ccaggacaat aaccttatag cttgtaaatt ctatcataat 10020 

tgggtaatga ctccaactta ttgatagtgt tttatgttca gataatgccc gatgactttg 10080 

tcatgcagct ccaccgattt tgagaacgac agcgacttcc gtcccagccg tgccaggtgc 10140 

tgcctcagat tcaggttatg ccgctcaatt cgctgcgtat atcgcttgct gattacgtgc 10200 

agctttccct tcaggcggga ttcatacagc ggccagccat ccgtcatcca tatcaccacg 10260 

tcaaagggtg acagcaggct cataagacgc cccagcgtcg ccatagtgcg ttcaccgaat 10320 

acgtgcgcaa caaccgtctt ccggagactg tcatacgcgt aaaacagcca gcgctggcgc 10380 

gatrtagccc cgacatagcc ccactgttcg tccatttccg cgcagacgat gacgtcactg 10440 

cccggctgta tgcgcgaggt taccgactgc ggcctgagtt ttttaagtga cgtaaaatcg 10500 
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tgttgaggcc aacgcccata atgcgggctg ttgcccggca tccaacgcca ttcatgccca 10560 

tatcaatgat tctctggtgc gtaccgcgtt gagaagcggt gtaagtgaac cgcagttgcc 10620 

atgttttacg gcagtgagag cagagatagc gctgatgtcc ggcggtgctt ttgccgtirac 10680 

gcaccacccc gtcagtagct gaacaggagg gacagctgat agacacagaa gccactggag 10740 

cacctcaaaa acaccatcat acactaaatc agtaagttgg cagcatcacc cataattgtg 10800 

gtttcaaaat cggctccgtc gatactatgt tatacgccaa ctttgaaaac aactttgaaa 10860 

aagctgtttr ctggtattta aggttttaga atgcaaggaa cagtgaattg gagttcgtct 10920 

tgttataatt agcttcttgg ggtatcttta aatactgtag aaaagaggaa ggaaataata 10980 

aatggctaaa atgagaatat caccggaatt gaaaaaactg atcgaaaaat accgctgcgt 11040 

aaaagatacg gaaggaatgt ctcctgctaa ggtatataag ctggtgggag aaaatgaaaa 11100 

cctatattta- aaaatgacgg acagccggta taaagggacc acctatgatg tggaacggga 11160 

aaaggacatg atgctatggc tggaaggaaa gctgcctgtt ccaaaggtcc tgcactttga 11220 

acggcatgat ggctggagca atctgctcat gagtgaggcc gatggcgtcc tttgctcgga 11280 

agagtatgaa gatgaacaaa gccctgaaaa gattatcgag ctgtatgcgg agtgcatcag 11340 

gctctttcac tccatcgaca tatcggattg tccctatacg aatagcttag acagccgctt 11400 

agccgaattg gattacttac tgaataacga tctggccgat gtggattgcg aaaactggga 11460 

agaagacact ccatttaaag atccgcgcga gctgtatgat tttttaaaga cggaaaagcc 11520 

cgaagaggaa cttgtctttt cccacggcga cctgggagac agcaacatct ttgtgaaaga 11580 

tggcaaagta agtggcttta ttgatcttgg gagaagcggc agggcggaca agtggtatga 11640 

cattgccttc tgcgtccggt cgatcaggga ggatatcggg gaagaacagt atgtcgagct 11700 

attttttgac ttactgggga tcaagcctga ttgggagaaa ataaaatatt atattttact 11760 
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ggatgaattg ttttagtacc tagatgtggc gcaacgatgc cggcgacaag cacgagccca 11S20 

ccgacttctn ccgcaicaag tgttttggct ctcaggccga ggcccacggc aagtatttgg 11880 

gcaaggggtc gctggtattc gtgcagggca agattcggaa taccaagtac gagaaggacg 11940 

gccagacggt ctacgggacc gacttcattg ccgataaggt ggatratctg gacaccaagg 12000 

caccaggcgg gtcaaatcag gaataagggc acattgcccc ggcgtgagtc ggggcaatcc 12060 

cgcaaggagg gtgaatgaat cggacgtttg accggaaggc atacaggcaa gaactgatcg 12120 

acgcggggtt trccgccgag gatgccgaaa ccatcgcaag ccgcaccgtc atgcgtgcgc 12180 

cccgcgaaac cttccagtcc gtcggctcga tggtccagca agctacggcc aagatcgagc 12240 

gcgacagcgt gcaactggct ccccctgccc tgcccgcgcc atcggccgcc gtggagcgtt 12300 

cgcgtcgtct cgaacaggag gcggcaggtt tggcgaagtc gatgaccatc gacacgcgag 12360 

gaactatgac gaccaagaag cgaaaaaccg ccggcgagga cctggcaaaa caggtcagcg 12420 

aggccaagca ggccgcgttg ctgaaacaca cgaagcagca gatcaaggaa atgcagcttt 12480 

ccttgttcga tattgcgccg tggccggaca cgatgcgagc gatgccaaac gacacggccc 12540 

gctctgccct gttcaccacg cgcaacaaga aaatcccgcg cgaggcgctg caaaacaagg 12600 

tcattttcca cgtcaacaag gacgtgaaga tcacctacac cggcgtcgag ctgcgggccg 12660 

acgatgacga actggtgtgg cagcaggtgt tggagtacgc gaagcgcacc cctatcggcg 12720 

agccgatcac cttcacgttc tacgagcttt gccaggacct gggctggtcg atcaatggcc 12780 

ggtattacac gaaggccgag gaatgcctgt cgcgcctaca ggcgacggcg atgggcttca 12840 

cgtccgaccg cgttgggcac ctggaatcgg tgtcgctgct gcaccgcttc cgcgtcctgg 12900 

accgtggcaa gaaaacgtcc cgttgccagg tcctgatcga cgaggaaatc gtcgtgctgt 12960 

ttgctggcga ccactacacg aaattcatat gggagaagta ccgcaagctg tcgccgacgg 13020 
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ccccacggat gttcgactat ttcagctccc accgggagcc gtaccccctc aagctcgaaa I30SO 

cctrccgcct catctgcgga tcggattcca cccgcgtgaa gaagtggcgc gagcaggtcg 13140 

gcgaagcctg cgaagagttg cgaggcagcg gcctggtgga acacgcctgg gtcaatgatg 13200 

acctggugca ttgcaaacgc tagggccttg tggggtcagt tccggctggg ggttcagcag 13260 

ccagcgcttt actggcattt caggaacaag cgggcactgc tcgacgcact tgcttcgctc 13320 

agtatcgctc gggacgcacg gcgcgctcta cgaactgccg ataaacagag gattaaaatt 13380 

gacaatrgtg attaaggctc agattcgacg gcttggagcg gccgacgtgc aggatttccg 13440 

cgagatccga ttgtcggccc tgaagaaagc tccagagatg ttcgggtccg tttacgagca 13500 

cgaggagaaa aagcccatgg aggcgttcgc tgaacggttg cgagatgccg tggcattcgg 13560 

cgcctacatc gacggcgaga tcattgggct gtcggtcttc aaacaggagg acggccccaa 13620 

ggacgctcac aaggcgcatc tgtccggcgt tttcgtggag cccgaacagc gaggccgagg 13680 

ggtcgccggt atgctgctgc gggcgttgcc ggcgggttta ttgctcgtga tgatcgtccg 13740 

acagattcca acgggaatct ggtggatgcg catcttcatc ctcggcgcac ttaatatttc 13800 

gctattctgg agcttgttgt ttatttcggt ctaccgcctg ccgggcgggg tcgcggcgac 13860 

ggtaggcgct gtgcagccgc tgatggtcgt gttcatctct gccgctctgc taggtagccc 13920 

gatacgattg atggcggtcc tgggggctat ttgcggaact gcgggcgtgg cgctgttggt 13980 

gttgacacca aacgcagcgc tagatcctgt cggcgtcgca gcgggcctgg cgggggcggt 14040 

ttccatggcg ttcggaaccg tgctgacccg caagtggcaa cctcccgtgc ctctgctcac 14100 

ctttaccgcc tggcaactgg cggccggagg acttctgctc gttccagtag ctttagtgtt 14160 

tgatccgcca atcccgatgc ctacaggaac caatgttctc ggcctggcgt ggctcggcct 14220 

gatcggagcg ggt.ttaacct acttcctttg gttccggggg atctcgcgac tcgaacctac 14280 
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acttgrctcc 
tcaggccgac 
gggagttgat 
gcmiaiccc 
gttaagcgag 
tgatcacagg 
atccgtgttt 
gcaaagtctg 
ctgtatcgag 
gcaggatata 
cgtttttaat 
cccttcaccg 
aggcgaaaat 
aatagcccga 
acgtggactc 
aaccatcacc 
ctaaagggag 
aagggaagaa 
tcggtgcggg 
ttaagttggg 
ttcgagctcg 
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ttactgggct 
agt cggaact 
atcgtcaacg 
gcgat t tcct 
aaa~cgaataa 
cagcaacgct 
caaacccggc 
ccgccttaca 
tggtgatttt 
ttgtggtgta 
gtactggggt 
cctggccctg 
cctgtttgat 
gatagggt tg 
caacgtcaaa 
caaatcaagt 
cccccgattt 
agcgaaagga 
cctcttcgct 
taacgccagg 
gtacccgggg 



ttctcagccc 
tcgggtcccc 
ttcacttcta 
attatgtcgg 
gaaggctgat 
ctgtcatcgt 
agcttagttg 
acggctctcc 
gtgccgagct 
aacaaattga 
ggtttttctt 
agagagttgc 
ggtggttccg 
agtgttgttc 
gggcgaaaaa 
tttttggggt 
agagcttgac 
gcgggcgcca 
attacgccag 
gttttcccag 
atctttcgac 



cagatctggg 
gacctgtacc 
aagaaatagc 
catagttctc 
aattcggatc 
tacaatcaac 
ccgttcttcc 
cgctgacgcc 
gccggtcggg 
cgcttagaca 
ttcaccagtg 
agcaagcggt 
aaatcggcaa 
cagtttggaa 
ccgtctatca 
cgaggtgccg 
ggggaaagcc 
ttcaggctgc 
ctggcgaaag 
tcacgacgtt 
actgaaatac 



gtcgatcagc 
at tcggngag 
gccactcagc 
aagatcgaca 
tctgcgaggg 
atgctaccct 
gaatagcatc 
gtcccggact 
gagctgttgg 
acttaataac 
agacgggcaa 
ccacgctggt 
aatcccttat 
caagagtcca 
gggcgatggc 
taaagcacta 
ggcgaacgtg 
gcaactgttg 
ggggatgtgc 
gtaaaacgac 
gtcgagcctg 



cggggatgca 

caatggatag 

ttcctcagcg 

gcccgtcacg 

agatgatatt 

ccgcgagatc 

ggtaacatga 

gatgggctgc 

ctggctggtg 

acat tgcgga 

cagctgattg 

ttgccccagc 

aaatcaaaag 

ctattaaaga 

ccactacgtg 

aatcggaacc 

gcgagaaagg 

ggaagggcga 

tgcaaggcga 

ggccagtgaa 

ctccgcttgg 



14340 

14400 

14 4 60 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 
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aacccccgac gaccctcgtc ctgtcacaac taccaacatc cegtacgata acgcccaci: 15600 

cccccagctc attaagagcc agttcatggg cgttggcatg atggccgtca tgcatctgta 15660 

cttcaagtac accaacgctc ttctgatcca gtcgatcatc cgctgaaggc gctttcgaat 15720 

ctggttaaga tccacgtctt cgggaagcca gcgacrggtg acctccagcg tccctitaag 157S0 

gctgccaaca gctttctcag ccagggccag cccaagaccg acaaggcctc cctccagaac 15840 

gccgagaaga actggagggg tggtgtcaag gaggagtaag ctccttattg aagtcggagg 15900 

acggagcggt' gtcaagagga tattcttcga ctctgtatta tagataagat gatgaggaat 15960 

tggaggtagc atagcttcat ttggatttgc tttccaggct gagactctag cttggagcat 16020 

agagggtcct ttggctttca atattctcaa gtatctcgag tttgaactta ttccctgtga 16080 

accttttatt caccaatgag cattggaatg aacatgaatc tgaggactgc aatcg'ccatg 16140 

aggttttcga aatacatccg gatgtcgaag gcttggggca cctgcgttgg ttgaatttag 16200 

aacgtggcac tattgatcat ccgatagctc tgcaaagggc gttgcacaat gcaagtcaaa 16260 

cgttgctagc agttccaggt ggaatgttat gatgagcatt gtattaaatc aggagatata 16320 

gcatgatctc tagttagctc accacaaaag tcagacggcg taaccaaaag tcacacaaca 16380 

caagctgtaa ggatttcggc acggctacgg aagacggaga agccaccttc agtggactcg 16440 

agtaccattt aattctattt gtgtttgatc gagacctaat acagccccta caacgaccat 16500 

caaagtcgta tagctaccag tgaggaagtg gactcaaatc gacttcagca acatctcctg 16560 

gataaacttt aagcctaaac tatacagaat aagataggtg gagagcttat accgagctcc 16620 

caaatctgtc cagatcatgg ttgaccggtg cctggatctt cctatagaat catccttatt 16680 

cgttgaccta gctgattctg gagtgaccca gagggtcatg acttgagcct aaaatccgcc 16740 

gcctccacca tttgtagaaa aatgtgacga actcgtgagc tctgtacagt gaccggtgac 16800 
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tctttctggc atcccgagag acggacggac gcagagagaa gggcrgagta ataagccact I6S60 

ggccagacag crctggcggc tctgaggtgc agtggatgat tattaatccg cqaccggccg 16920 

ccc'ctccgcc ccgaagtgga aaggctggtg tgcccctcgt tgaccaagaa tctat tgcat 16980 

catcggagaa tatggagctt catcgaatca ccggcagtaa gcgaaggaga atgtgaagcc 17040 

aggggtgtat agccgtcggc gaaatagcat gccattaacc taggtacaga agtccaattg 17100 

cttccgatct ggtaaaagat tcacgagata gtaccttctc cgaagtaggt agagcgagta 17160 

cccggcgcgt aagctcccta attggcccat ccggcatctg tagggcgtcc aaatatcgtg 17220 

cctctcctgc tttgcccggt gtatgaaacc ggaaaggccg ctcaggagct ggccagcggc 17280 

gcagaccggg aacacaagct ggcagtcgac ccatccggtg ctctgcactc gacctgctga 17340 

ggtccctcag tccctggtag gcagctttgc cccgtctgtc cgcccggtgt gtcggcgggg 17400 

ttgacaaggt cgttgcgtca gtccaacatt tgttgccata ttttcctgct ctccccacca 17460 

gctgctcttt tcttttctct ttcttttccc atcttcagta tattcatctt cccatccaag 17520 

aacctttatt tcccctaagt aagtactttg ctacatccat actccatcct tcccatccct 17580 

tattcctttg aacctttcag ttcgagcttt cccacttcat cgcagcttga ctaacagcta 17640 

ccccgcttga gcagacatca ccatgcctga actcaccgcg acgtctgtcg agaagtttct 17700 

gatcgaaaag ttcgacagcg tctccgacct gatgcagctc tcggagggcg aagaatctcg 17760 

tgctttcagc ttcgatgtag gagggcgtgg atatgtcctg cgggtaaata gctgcgccga 17820 

tggtttctac aaagatcgtt atgtttatcg gcactttgca tcggccgcgc tcccgattcc 17880 

ggaagtgctt gacattgggg aattcagcga gagcctgacc tattgcatct cccgccgtgc 17940 

acagggtgtc acgttgcaag acctgcctga aaccgaactg cccgctgttc tgcagccggt 18000 

cgcggaggcc atggatgcga tcgctgcggc cgatcttagc cagacgagcg ggttcggccc 18060 
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ancggaccg caaggaatcg gtcaatacac tacatggcgt ganttcatat gcgcgatcgc 18120 

tgatccccat c:ciaicact ggcaaactgt gatggacgac accgtcagtc cgtccgtcgc 1S1S0 

gcaggctctc gatgagctga tgctttgggc cgaggactgc cccgaagtcc ggcaccrcgt IS240 

gcacgcggat txcggctcca acaatgtcct gacggacaat ggccgcataa cagcggtcat 1S3C0 

tgactggagc gaggcgatgt tcggggattc ccaatacgag gtcgccaaca tcttcttctg 18360 

gaggccgtgg ttggcttgta tggagcagca gacgcgctac ttcgagcgga ggcatccgga 18420 

gcttgcagga- tcgccgcggc tccgggcgta tatgctccgc attggtcttg accaactcta 18480 

tcagagcttg gttgacggca atttcgatga tgcagcttgg gcgcagggtc gatgcgacgc 18540 

aatcgtccga tccggagccg ggactgtcgg gcgtacacaa atcgcccgca gaagcgcggc 18600 

cgtctggacc gatggctgtg tagaagtact cgccgatagt ggaaaccgac gccccagcac 18660 

tcgtccgagg gcaaaggaat agagtagatg ccgaccgcgg gatcgatcca cttaacgtta 18720 

ctgaaatcat caaacagctt gacgaatctg gatataagat cgttggtgtc gatgtcagct 18780 

ccggagttga gacaaatggt gttcaggatc tcgataagat acgttcattt gtccaagcag 18840 

caaagagtgc cttctagtga tttaatagct ccatgtcaac aagaataaaa cgcgttttcg 18900 

ggtttacctc ttccagatac agctcatctg caatgcatta atgcattgac tgcaacctag 18960 

taacgccttn caggctccgg cgaagagaag aatagcttag cagagctatt ttcattttcg 19020 

ggagacgaga tcaagcagat caacggtcgt caagagacct acgagactga ggaatccgct 19080 

cttggctcca cgcgactata tatttgtctc taattgtact ttgacatgct cctcttcttt 19140 

actctgatag cttgactatg aaaattccgt caccagcncc tgggttcgca aagataattg 19200 

catgtttctt ccttgaactc tcaagcctac aggacacaca ttcatcgtag gtataaacct 19260 

cgaaatcant tcctactaag atggtataca atagtaacca tgcatggttg cctagtgaat 19320 



BASF NAE 878/03 
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gctccgtaac acccaatecg ccggccgaaa cttttttaca actctcccat gagtcgttta 193S0 
cccagaatgc acaggtacac ttgtttagag gtaatccttc tttctagcta gaagtcctcg 19440 
tgtactgtgt aagcgcccac tccacatctc cactcgacct gcaggcatgc a 194 91 

<210> 46 

<211> 21300 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid 
<220> 

<221> misc_feature 

<222> (3471) . . (3471) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (3679) . . (3679) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (3770) . . (3770) 

<223> n is a, c, g, or t 

<400> 46 

gatctttcga cactgaaata cgtcgagcct gctccgcttg gaagcggcga ggagcctcgt 60 

cctgtcacaa ctaccaacat ggagtacgat aagggccagt tccgccagct cattaagagc 120 

cagttcatgg gcgttggcat gatggccgtc atgcatctgt acttcaagta caccaacgct 180 

cttctgatcc agtcgatcat ccgctgaagg cgctttcgaa tctggttaag atccacgtct 240 

tcgggaagcc agcgactggt gacctccagc gtccctttaa ggctgccaac agctttctca 300 
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gccagggcca gcccaacacc gacaaggcct ccctccagaa cgccgagaac aactcgagcg 360 

gtggtgtcaa gcaggagtaa gctccttatt gaagtcggag gacggagcgg tgtcaagagg 420 

atattcttcg actctgtatt atagataaga tgatgaggaa ttggaggtag catagcttca 4S0 

tttggatttg cttrccaggc tgagactcta gcttggagca tagagggrcc Lttgccticc 540 

aatattctca "agtatctcga gtttgaactt attccctgtg aaccttttat tcaccaatga 600 

gcattggaat gaacatgaat ctgaggactg caatcgccat gaggttttcg aaatacatcc 660 

ggatigtcgaa ggcttggggc acctgcgttg gttgaattta gaacgtggca ctattgatca 720 

tccgatagct ctgcaaaggg cgttgcacaa tgcaagtcaa acgttgctag cagttccagg 780 

tggaatgtta tgatgagcat tgtattaaat caggagatat agcatgatct ctagttagct 840 

caccacaaaa gtcagacggc gtaaccaaaa gtcacacaac acaagctgta aggatttcgg 900 

cacggctacg gaagacggag aagccacctt cagtggactc gagtaccatt taattctatt 960 

tgtgtttgat cgagacctaa tacagcccct acaacgacca tcaaagtcgt atagctacca 1020 

gtgaggaagt ggactcaaat cgacttcagc aacatctcct ggataaactt taagcctaaa 1080 

ctatacagaa taagataggt ggagagctta taccgagctc ccaaatctgt ccagatcatg 1140 

gttgaccggt gcctggatct tcctatagaa tcatccttat tcgttgacct agctgattct 1200 

ggagtgaccc agagggtcat gacttgagcc taaaatccgc cgcctccacc atttgtagaa 1260 

aaatgtgacg aactcgtgag ctctgtacag tgaccggtga ctctttctgg catgcggaga 1320 

gacggacgga cgcagagaga agggctgagt aataagccac tggccagaca gctctggcgg 1380 

ctctgaggtg cagtggatga ttattaatcc gggaccggcc gcccctccgc cccgaagtgg 1440 

aaaggctggt gtgcccctcg ttgaccaaga atctattgca tcatcggaga atatggagct 1500 

tcatcgaatc accggcagta agcgaaggag aatgtgaagc caggggtgta tagccgtcgg 1560 
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cgaaatacca tgccattaac ctaggtacag aagtccaatt gcttccgatc tgctaaaaga 1620 

ttcacgagat actaccttct ccgaagtagg tagagcgact accccocqcc taagctccci 16S0 

aattggccca tccggcatct gtagggcgtc caaatatcgt gcctctcctg crttgcccgg 1740 

tgiatcaaac cggaaaggcc gctcaggagc tggccagcgg cgcagaccgg gaacacaagc 1800 

tggcagtcga cccatccggt gctctgcact cgacctgctg aggtccctca gtccctggta 1860 

t 

ggcagctttg ccccgtctgt ccgcccggtg tgtcggcggg gttgacaagg tcgttgcgtc 1920 

agtccaacat ttgttgccat attttcctgc tctccccacc agctgctctt ttcttttctc 1980 

tttcttttcc catcttcagt atattcatct tcccatccaa gaacctttat ttcccctaag 2040 

taagtacttt gctacatcca tactccatcc ttcccatccc ttattccttt gaacctttca 2100 

gttcgagctt tcccacttca tcgcagcttg actaacagct accccgcttg agcagacatc 2160 

accatgcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa gttcgacagc 2220 

gtctccgacc tgatgcagct ctcggagggc gaagaatctc gtgctttcag cttcgatgta 2280 

ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg atggtttcta caaagatcgt 2340 

tatgtttatc ggcactttgc atcggccgcg ctcccgattc cggaagtgct tgacattggg 2400 

gaattcagcg agagcctgac ctattgcatc tcccgccgtg cacagggtgt cacgttgcaa 2460 

gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg 2520 

atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc 2580 

ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac 2640 

tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg 2700 

atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga tttcggctcc 2760 

aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag cgaggcgatg 2820 
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tucggggarr cccaatacga ggtcgccaac atcttctcct ggaggccgtg gutgccttgt 2SS0 

atggagcagc agacgcgcta cttcgagcgg aggcatccgg ccctigcagg atcgccgcgg 2940 

ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt ggttgacggc 3000 

aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caarcgtccg atccggagcc 3060 

gggaccgtcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac cgatggctgt 3120 

gtagaagtac tcgccgatag tggaaaccga cgccccagca ctcgtccgag ggcaaaggaa 3180 

tagagtagat gccgaccgcg ggatcgatcc acttaacgtt actgaaatca tcaaacagct 3240 

tgacgaatct ggatataaga tcgttggtgt cgatgtcagc tccggagttg agacaaatgg 3300 

' tgttcaggat ctcgataaga tacgttcatt tgtccaagca gcaaagagtg ccttctagtg 3360 

atttaatagc tccatgtcaa caagaataaa acgcgttttc gggtttacct cttccagata 3420 

cagctcatct gcaatgcatt aatgcattga ctgcaaccta gtaacgcctt ncaggctccg - 3480 

gcgaagagaa gaatagctta gcagagctat tttcattttc gggagacgag atcaagcaga 3540 

tcaacggtcg tcaagagacc tacgagactg aggaatccgc tcttggctcc acgcgactat 3600 

atatttgtct ctaattgtac tttgacatgc tcctcttctt tactctgata gcttgactat 3660 

gaaaattccg tcaccagcnc ctgggttcgc aaagataatt gcatgtttct tccttgaact 3720 

ctcaagccta caggacacac attcatcgta ggtataaacc tcgaaatcan ttcctactaa 3780 

gatggtatac aatagtaacc atgcatggtt gcctagtgaa tgctccgtaa cacccaatac 3840 

gccggccgaa acttttttac aactctccta tgagtcgttt acccagaatg cacaggtaca 3900 

cttgtttaga ggtaatcctt ctttctagct agaagtcctc gtgtactgtg taagcgccca 3960 

ctccacatct ccactcgacc tgcaggcatg caagcttgaa ttcgagctcg gtacccgggg 4020 

atctttcgac actgaaatac gtcgagcctg ctccgcttgg aagcggcgag gagcctcgtc 4080 
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ctgtcacaac taccaacatg gagtacgata agggccagtt ccgccagctc attaagaqcc 4140 

agttcatggg cgttggcatg atggccgtca tgcatctgta cttcaagtac accaacgctc 4200 

ttctgatcca gtcgatcatc cgctgaaggc gctttcgaat cnggttaaga tccacgtcrt 4260 

cgggaagcca gcgactggtg acctccagcg tccctttaag gctgccaaca gctttctcag 4320 

ccagggccag cccaagaccg acaaggcctc cctccagaac gccgagaaga actggagggg 4380 

tggtgtcaag gaggagtaag ctccttattg aagtcggagg acggagcggt gtcaagagga 4440 

tattcttcga ctctgtatta tagataagat gatgaggaat tggaggtagc atagcttcat 4500 

ttggatttgc tttccaggct gagactctag cttggagcat agagggtcct ttggctttca 4560 

atattctcaa gtatctcgag tttgaactta ttccctgtga accttttatt caccaatgag 4620 

cattggaatg aacatgaatc tgaggactgc aatcgccatg aggttttcga aatacatccg 4680 

gatgtcgaag gcttggggca cctgcgttgg ttgaatttag aacgtggcac tattgatcat 4740 

ccgatagctc tgcaaagggc gttgcacaat gcaagtcaaa cgttgctagc agttccaggt 4800 

ggaatgttat gatgagcatt gtattaaatc aggagatata gcatgatctc tagttagct'c 4860 

accacaaaag tcagacggcg taaccaaaag tcacacaaca caagctgtaa ggatttcggc 4920 

acggctacgg -aagacggaga agccaccttc agtggactcg agtaccattt aattctattt 4980 

gtgtttgatc gagacctaat acagccccta caacgaccat caaagtcgta tagctaccag 5040 

tgaggaagtg gactcaaatc gacttcagca acatctcctg gataaacttt aagcctaaac 5100 

tatacagaat aagataggtg gagagcttat accgagctcc caaatctgtc cagatcatgg 5160 

ttgaccggtg cctggatctt cctatagaat catccttatt cgttgaccta gctgattctg 5220 

gagtgaccca gagggtcatg acttgagcct aaaatccgcc gcctccacca tttgtagaaa 5280 

aatgtgacga actcgtgagc tctgtacagt gaccggtgac tctttctggc atgcggagag 5340 
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acqgacgcac gcacagagaa gggctgagta ataagccact ggccagacag ctctggcgcc 5400 

tctgaggtgc agiggaigai tattaatccg ggaccggccc cccctccqcc ccgaagrgga 5460 

aaggctggtg rgcccctcgt tgaccaagaa tctattgcat catcggagaa tatggagctt 5520 

catcgaatca ccggcagtaa gcgaaggaga atgtgaagcc aggggtgtat agccgtcggc 5530 

gaaatagcat gccattaacc taggtacaga agtccaattg cttccgatct ggtaaaagat 5640 

tcacgagata gtaccttctc cgaagtaggt agagcgagta cccggcgcgt aagct cccta 5700 

attggcccat - ccggcatctg tagggcgtcc aaatatcgtg cctctcctgc tttgcccggt 5760 

gtatgaaacc ggaaaggccg ctcaggagct ggccagcggc gcagaccggg aacacaagct 5820 

ggcagtcgac ccatccggtg ctctgcactc gacctgctga ggtccctcag tccctggtag 5880 

gcagctttgc cccgtctgtc cgcccggtgt gtcggcgggg ttgacaaggt cgttgcgtca 5940 

gtccaacatt tgttgccata ttttcctgct ctccccacca gctgctcttt tcttttctct 6000 

ttcttttccc atcttcagta tattcatctt cccatccaag aacctttatt tcccctaagt 6060 

aagtactttg ctacatccat actccatcct tcccatccct tattcctttg aacctttcag 6120 

ttcgagcttt cccacttcat cgcagcttga ctaacagcta ccccgcttga gcagacatca 6180 

ccatgtcaat actcacttat ctggaatttc atctctacta tacactacct gtccttgcgg 6240 

cattgtgttg gctgctaaag ccgtttcact cacagcaaga caatctcaag tataaatttt 6300 

taatgttgat ggccgcctct accgcatcga tttgggacaa ttatatcgtt tatcatcgcg 6360 

cttggtggta ctgtcctact tgtgttgtgg ctgtcattgg ctatgtacct ctagaagaat 6420 

acatgttctt tatcatcatg actttaatga ctgtcgcgtt ctcaaacttt gttatgcgtt 6480 

ggcacttgca tactttcttt attagaccca acacttcttg gaagcaaaca ctattagtac 6540 

gccttgtgcc tgtttcagct ttattggcaa tcacttatca tgcttggcac ttgacactgc 6600 
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caaataaacc ttcattttan ggttcatgca tcctttggta tgcttgtcct gtgttggcta 6660 

ttctttggct gggtgctggc gaatatatct tgcgtcgacc tctggctgtc cttttgtcra 6720 

ttgttatccc tagtgtatac ctatgttggg ctgatatcgt cgctattagt gctggcacat 6780 

ggcatatttc tcttagaaca agcactggca aaatggtagt acccgattta cctgtagaag 6840 

aatgcctgtt ttttactttg atcaacacag tcttggtttt tgctacctgt gctatagacc 6900 

gcgctcaggc catcctccat gtgagcgcgc gtaatacgac tcactatagg gcgaattgga 6960 

gctccaccgc ggtggcggcc gctctagaac tagtggatcc cccgggctgc aggaattcgg 7020 

cacgagctac atttcacaag cccgtgagcg gtgcaagcgc tctgccccac atcggcccac 7080 

ctcctcatct ccatcggtca tttgctgcta ccacgatgct gtcgaagctg cagtcaatca 7140 

gcgtcaaggc ccgccgcgtt gaactagccc gcgacatcac gcggcccaaa gtctgcctgc 7200 

atgctcagcg gtgctcgtta gttcggctgc gagtggcagc accacagaca gaggaggcgc 7260 

tgggaaccgt gcaggctgcc ggcgcgggcg atgagcacag cgccgatgta gcactccagc 7320 

agcttgaccg ggctatcgca gagcgtcgtg cccggcgcaa acgggagcag ctgtcatacc 7380 

aggctgccgc cattgcagca tcaattggcg tgtcaggcat tgccatcttc gccacctacc 7440 

tgagatttgc catgcacatg accgtgggcg gcgcagtgcc atggggtgaa gtggctggca 7.500 

ctctcctctt ggtggttggt ggcgcgctcg gcatggagat gtatgcccgc tatgcacaca 7560 

aagccatctg gcatgagtcg cctctgggct ggctgctgca caagagccac cacacacctc 7620 

gcactggacc ctttgaagcc aacgacttgt ttgcaatcat caatggactg cccgccatgc 7680 

tcctgtgtac ctttggcttc tggctgccca acgtcctggg ggcggcctgc tttggagcgg 7740 

ggctgggcat cacgctatac ggcatggcat atatgtttgt acacgatggc ctggtgcaca 7800 

ggcgctttcc caccgggccc atcgctggcc tgccctacat gaagcgcctg acagtggccc 7860 
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accagctaca ccacagcggc aagtacggtc ccqccccczg gggtatgttc tLcggtccac 7920 

aggagctgca gcacattcca ggtgcggcgg aggaggtgga gcgactggtc crggaactgg 79S0 

actggtccaa gcgggctcag gccatccrcc atctgtacaa atcatctgtr caaaaicaaa 8040 

accctaaaca agccatttcc cttttccagc atgtcaaaga gctagcatgg gcctrctgtc 8100 

ttcctgacca aatgctcaac aatgaattgt ttgatgatct tactatcagc tgggatattt 8160 

tacgtaaagc ctcaaagtca ttctatactg catctgccgt ttttccaagt tatgtacgtc 8220 

aagacttggg tgttctctat gctttctgca gagctaccga tgaccrgtgc gatgatgaat 8280 

ccaaatct-gt tcaagaaaga agagaccaat tagatcttac tcgacaattt gttcgtgatc . 8340 

tctttagcca aaagaccagt gcgcctattg tgattgattg ggaattgtat caaaaccaac 8400 

ttcctgcttc ttgtatatca gcctttagag cctttactcg ccttcgccat gtccttgaag 8460 

tagaccctgt agaagaacta ttagatggtt acaaatggga tcttgagcgt cgtcctatcc 8520 

ttgatgaaca agacttggag gcatactctg cttgtgtggc cagtagtgtg ggtgaaatgt 8580 

gcacacgtgt gattcttgct caagaccaaa aggaaaatga tgcttggata attgaccgtg 8640 

cacgtgagat ggggctggtg ctacaatacg ttaacattgc tcgagacatt gtgactgata 8700 

gcgagactct gggtcgatgt tatctgcctc aacaatggct tagaaaagaa gaaacagaac 8760 

aaatacagca aggcaacgcc cgtagcctag gtgatcaaag actgttgggc ttgtctctga 8820 

agcttgtagg aaaggcagac gctatcatgg tgagagctaa gaagggcatt gacaagttgc 8880 

cggcaaactg tcaaggcggt gtacgagctg cttgccaagt atatgctgca attggatctg 8940 

tactcaagca gcagaagaca acatatccta caagagctca tctaaaagga agcgaacgtg 9000 

ccaagattgc tctgttgagt gtatacaacc tctatcaatc tgaagacaag cctgtggctc 9060 

tccgtcaagc tagaaagatt aagagttttt ttgttgatta gtgaattttt gttttattta 9120 
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tgt ctcatag ttcaaraaac agacaacaca tacaatataa aatcattcic ttuaaatgtt SISO 

aatttactag agtgtaaagc ctgcattttt tttgtacgca taaacaatga gttcaccccg 9240 

cttctggttt ttaaataatt atgtcaaact agggaaaatt cttttttttc tcttcgttcc 9300 

ttttttggct tgttgtggag tcacaggctt gtcttcagat tgatagaggt tgtatacact 9360 

caacagagca atcttggcac gttcgcttcc ttttagatga gctcttgtag gatatgttgt 9420 

cttctgctgc ttgagtacag atccaattgc agcatatact tggcaagcag ctcgtacacc 9480 

gccttgacag tttgccggca acttgtcaat gcccttctta gctctcacca tgatagcgtc 9540 

tgcctttcct acaagcttgg cgtaatcatg gtcatagctg tttcctgtgt gaaattgtta 9600 

tccgctcaca attccacaca acatacgagc cggaagcata aagtgtaaag cctggggtgc 9660 

ctaatgagtg agctaactca cattaattgc gttgcgctca ctgcccgctt tccagtcggg 9720 

aaacctgtcg tgccagctgc attaatgaat cggccaacgc gcggggagag gcggtttgcg 9780 

tattgggcca aagacaaaag ggcgacattc aaccgattga gggagggaag gtaaatattg 9840 

acggaaatta ttcattaaag gtgaattatc accgtcaccg acttgagcca tttgggaatt 9900 

agagccagca aaatcaccag tagcaccatt accattagca aggccggaaa cgtcaccaat 9960 

gaaaccatcg atagcagcac cgtaatcagt agcgacagaa tcaagtttgc ctttagcgtc 10020 

agactgtagc gcgttttcat cggcattttc ggtcatagcc cccttattag cgtttgccat 10080 

cttttcataa tcaaaatcac cggaaccaga gccaccaccg gaaccgcctc cctcagagcc 10140 

gccaccctca gaaccgccac cctcagagcc accaccctca gagccgccac cagaaccacc 10200 

accagagccg ccgccagcat tgacaggagg cccgatctag taacatagat gacaccgcgc 10260 

gcgataattt atcctagttt gcgcgctata ttttgttttc tatcgcgtat taaatgtata 10320 

attgcgggac tctaatcata aaaacccatc tcataaataa cgtcatgcat tacatgttaa 10380 
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tt attacatc 
cagcattcaa 
tctgtggcgg 
tcttgcctgg 
cgctcaggat 
cgaccgcctg 
ggaggatcat 
cggcggtgga 
accccagagt 
atcgggagcg 
ttcagcaata 
gccacagtcg 
atcgccatgg 
cagttcggct 
ggcttccatc 
ggtagccgga 
ggcaggagca 
gtcccttccc 
cagccacgat 
cttgacaaaa 
gccgattgtc 



cttaacgtaa 
cCiiaagacc 
gaactccacg 
gcagtcgccg 
cgtggcgttg 
ttccgcagag 
ccagccggcg 
atcgaaatct 
cccgctcaga 
gcgataccgt 
tcacgggtag 
atgaatccag 
gtcacgacga 
ggcgcgagcc 
cgagtacgtg 
tcaagcgtat 
aggtgagatg 
gcttcagtga 
agccgcgctg 
agaaccgggc 
tgttgtgccc 



ttcaacagaa 
ctttattgcc 
aaaatatccg 
ccgacgccgt 
tgcttgtcgg 
atcccgtggg 
tcccggaaaa 
cgtgatggca 
agaactcgtc 
aaagcacgag 
ccaacgctat 
aaaagcggcc 
gatcatcgcc 
cctgatgct c 
ctcgctcgat 
gcagccgccg 
acaggagat c 
caacgtcgag 
cctcgtcctg 
gcccctgcgc 
agtcatagcc 



attazaigai 
aaa"cg"ct"cga 
aacgcagcaa 
tgatgtggac 
ccgttgctgt 
cgaagaactc 
cgattccgaa 
ggttgggcgt 
aagaaggcga 
gaagcggtca 
gtcctgatag 
attttccacc 
gtcgggcatg 
ttcgtccaga 
gcgatgtttc 
cattgcatca 
ctgccccggc 
cacagctgcg 
cagttcattc 
tgacagccgg 
gaatagcctc 



aatcatcgca 
acqa t cgcgg 
gatatcgcgg 
gccgggcccg 
cgtaatgata 
cagcatgaga 
gcccaacctt 
cgcttggtcg 
tagaaggcga 
gcccattcgc 
cggtccgcca 
atgatattcg 
cgcgccttga 
tcatcctgat 
gcttggtggt 
gccatgatgg 
acttcgccca 
caaggaacgc 
agggcaccgg 
aacacggcgg 
tccacccaag 



agaccgccaa 
a teat cegge 
tgcat ci egg 
at can attgt 
tcggcacctt 
tccccgcgct 
tcatai gaagg 
gtcat ttcga 
tgegctgega 
cgcca'agctc 
cacccagccg 
gcaacjcaggc 
gcctcjgcgaa 
cgaca agacc 
cgaat gggca 
atact ttctc 
atagcageca 
ccgt cgtggc 
acagcjtcggt 
catcagagca 
eggceggaga 



10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 
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acctgcgrgc aatccarctt gttcaatcat gcgaaacgat ccagatccgg ngcacattat 11700 

ttggattgag agtgaatatg agactctaat tggataccga ggggaattta tggaacgtca 11760 

gtggagcatt tttgacaaga aatatttgct agctgatagt gaccttaggc gacttttgaa 11820 

cgcgcaaiaa tggtttctga cgtatgtgct tagctcatta aactccagaa acccgcggct 11880 

gagtggctcc ttcaacgttg cggttctgtc agttccaaac gtaaaacggc ttgtcccgcg 11940 

tcatcggcgg gggtcataac gtgactccct taattctccg ctcatgatca gattgtcgtt 12000 

tcccgccttc agtttaaact atcagtgttt gacaggatat attggcgggt aaacctaaga 12060 

gaaaagagcg tttattagaa taatcggata tttaaaaggg cgtgaaaagg tttatccgtt 12120 

cgtccatttg tatgtgcatg ccaaccacag ggttccccag atctggcgcc ggccagcgag 12180 

acgagcaaga ttggccgccg cccgaaacga tccgacagcg cgcccagcac aggtgcgcag 12240 

gcaaattgca ccaacgcata cagcgccagc agaatgccat agtgggcggt gacgtcgttc 12300 

gagtgaacca gatcgcgcag gaggcccggc agcaccggca taatcaggcc gatgccgaca 12360 

gcgtcgagcg cgacagtgct cagaattacg atcaggggta tgttgggttt cacgtctggc 12420 

ctccggacca gcctccgctg gtccgattga acgcgcggat tctttatcac tgataagttg 12480 

gtggacatat tatgtttatc agtgataaag tgtcaagcat gacaaagttg cagccgaata 12-540 

cagtgatccg tgccgccctg gacctgttga acgaggtcgg cgtagacggt ctgacgacac 12600 

gcaaactggc ggaacggttg ggggttcagc agccggcgct ttactggcac ttcaggaaca 12660 

agcgggcgct gctcgacgca ctggccgaag ccatgctggc ggagaatcat acgcattcgg 12720 

tgccgagagc cgacgacgac tggcgctcat ttctgatcgg gaatgcccgc agcttcaggc 12780 

aggcgctgct cgcctaccgc gatggcgcgc gcatccatgc cggcacgcga ccgggcgcac 12840 

cgcagatgga aacggccgac gcgcagcttc gcttcctctg cgaggcgggt ttttcggccg 12900 
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gggacgccgt caatgcgctg atgacaatca gctacttcac tgttgcggcc gtgcttgagg 12960 

agcaggccgg cgacagcgat gccggcgagc gcggcggcac ccttgaacag cctccgctct 13020 

cgccgctgrt gcgggccgcg atagacgcct tcgacgaagc cggtccggac gcagcgttcg 13080 

agcagggact cgcggtgatt gtcgatggat tggcgaaaag gaggctcgct gtcaggaacg 13140 

ttgaaggacc gagaaagggt gacgattgat caggaccgct gccggagcgc aacccactca 13200 

ctacagcaga gccatgtaga caacatcccc tccccctttc caccgcgtca gacgcccgta 13260 

gcagcccgct acgggctttt tcatgccctg ccctagcgtc caagcctcac ggccgcgctc 13320 

ggcctctctg gcggccttct ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 13380 

ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 13440 

agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 13500 

ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 13560 

caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 13620 

gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 13680 

cctgtccgcc tttctccctt cgggaagcgt ggcgcttttc cgctgcataa ccctgcttcg 13740 

gggtcattat agcgattttt tcggtatatc catccttttt cgcacgatat acaggatttt 13800 

gccaaagggt tcgtgtagac tttccttggt gtatccaacg gcgtcagccg ggcaggatag 13860 

gtgaagtagg cccacccgcg agcgggtgtt ccttcttcac tgtcccttat tcgcacctgg 13920 

cggtgctcaa cgggaatcct gctctgcgag gctggccggc taccgccggc gtaacagatg 13980 

agggcaagcg gatggctgat gaaaccaagc caaccaggaa gggcagccca cctatcaagg 14040 

tgtactgcct tccagacgaa cgaagagcga ttgaggaaaa ggcggcggcg gccggcatga 14100 

gcctgtcggc ctacctgctg gccgtcggcc agggctacaa aatcacgggc gtcgtggact 14160 
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atgagcacgt ccgccagctg gcccgcatca atggcgacct gcgccgccrg ggcggccrgc 14220 

tcaaactcrg cctcaccgac gacccgcgca cggcgcggtt cggtgargcc acgatcctcg 14280 

ccctgctggc gaagatcgaa gagaagcagg acgagcttgg caaggtcatg atgggcgtgg 14340 

tccgcccgag ggcagagcca tgactttttt agccgctaaa acggccgggg ggrgcgcgtg 14400 

attgccaagc acgtccccat gcgctccatc aagaagagcg acttcgcgga gctggtgaag 14460 

tacatcaccg acgagcaagg caagaccgag cgcctttgcg acgctcaccg ggctggttgc 14520 

cctcgccgct gggctggcgg ccgtctatgg ccctgcaaac gcgccagaaa cgccgtcgaa 14580 

gccgtgtgcg agacaccgcg gccgccggcg ttgtggatac ctcgcggaaa acttggccct 14 640 

cactgacaga tgaggggcgg acgttgacac ttgaggggcc gactcacccg gcgcggcgtt 14700 

gacagatgag gggcaggctc gatttcggcc ggcgacgtgg agctggccag cctcgcaaat 14760 

eggcgaaaac gcctgatttt acgcgagttt cccacagatg atgtggacaa gcctggggat 14820 ■ 

aagtgccctg cggtattgac acttgagggg cgcgactact gacagatgag gggcgcgatc 14880 

cttgacactt gaggggcaga gtgctgacag atgaggggcg cacctattga catttgaggg 14940 

gctgtccaca ggcagaaaat ccagcatttg caagggtttc cgcccgtttt tcggccaccg 15000 

ctaacctgtc ttttaacctg cttttaaacc aatatttata aaccttgttt ttaaccaggg 15060 

ctgcgccctg tgcgcgtgac cgcgcacgcc gaaggggggt gccccccctt ctcgaaccct 15120 

cccggcccgc taacgcgggc ctcccatccc cccaggggct gcgcccctcg gccgcgaacg 15180 

gcctcacccc aaaaatggca gcgctggcag tccttgccat tgccgggatc ggggcagtaa 15240 

cgggatgggc gatcagcccg agcgcgacgc ccggaagcat tgacgtgccg caggtgctgg 15300 

catcgacatt cagcgaccag gtgccgggca gtgagggcgg cggcctgggt ggcggcctgc 15360 

ccttcacttc ggccgtcggg gcattcacgg acttcatggc ggggccggca atttttacct 15420 
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tgggcattct nggcatagtg gtcgcgggtg ccgtgctcgt gttcgggggt ccgataaacc 15480 

cagccaaccc trtgaggtga taggtaagat tataccgagg tatgaaaacc agaartggac 15540 

ctttacagaa ttactctatg aagcgccata tttaaaaagc taccaagacg aagaggarga 15600 

agaggatgag gaggcagatt gccttgaata tattgacaat actgataaga laatataicc 15660 

tttanataga agatatcgcc gtatgtaagg atttcagggg gcaaggcata ggcagcgcgc 15720 

ttatcaatat atctatagaa tgggcaaagc ataaaaactt gcatggacta atgcttgaaa 15780 

cccaggacaa taaccttata gcttgtaaat tctatcataa ttgggtaatg actccaactt 15840 

attgatagtg ttttatgttc agataatgcc cgatgacttt gtcatgcagc tccaccgatt . 15900 

ttgagaacga cagcgacttc cgtcccagcc gtgccaggtg ctgcctcaga ttcaggttat 15960 

gccgctcaat tcgctgcgta tatcgcttgc ' tgattacgtg cagctttccc ttcaggcggg 16020 

attcatacag cggccagcca tccgtcatcc atatcaccac gtcaaagggt gacagcaggc 16080 

tcataagacg ccccag-cgtc gccatagtgc gttcaccgaa tacgtgcgca acaaccgtct 16140 

tccggagact gtcatacgcg taaaacagcc agcgctggcg cgatttagcc ccgacatagc 16200 

cccactgttc gtccatttcc gcgcagacga tgacgtcact gcccggctgt atgcgcgagg 16260 

ttaccgactg cggcctgagt tttttaagtg acgtaaaatc gtgttgaggc caacgcccat 16320 

aatgcgggct gttgcccggc atccaacgcc attcatggcc atatcaatga ttttctggtg 16380 

cgtaccgggt tgagaagcgg tgtaagtgaa ctgcagttgc catgttttac ggcagtgaga 16440 

gcagagatag cgctgatgtc cggcggtgct tttgccgtta cgcaccaccc cgtcagtagc 16500 

tgaacaggag ggacagctga tagacacaga agccactgga gcacctcaaa aacaccatca 16560 

tacactaaat cagtaagttg gcagcatcac ccataattgt ggtttcaaaa tcggctccgt 16620 

cgatactatg ttatacgcca actttgaaaa caactttgaa aaagctgttt tctggtattt 16680 
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ggatgccgaa accatcgcaa gccgcaccgt 
cgtcggctcc arggtccagc aagctacggc 
tcccccugcc ctgcccgcgc catcggccgc 
ggcggcaggt t rggcgaagt cgaigaccat 
gcgaaaaacc gccggcgagg acctggcaaa 
gc'tgaaacac acgaagcagc agatcaagga 
gtggccggac acgatgcgag cgatgccaaa 
gcgcaacaag aaaatcccgc gcgaggcgct 
ggacgtgaag atcacctaca ccggcgtcga 
gcagcaggtg ttggagtacg cgaagcgcac 
ctacgagctt tgccaggacc tgggctggtc 
ggaatgcctg tcgcgcctac aggcgacggc 
cctggaatcg gtgtcgctgc tgcaccgctt 
ccgttgccag gtcctgatcg acgaggaaat 
gaaattcata tgggagaagt accgcaagct 
tttcagctcg caccgggagc cgtacccgct 
atcggattcc acccgcgtga agaagtggcg 
gcgaggcagc ggcctggtgg aacacgcctg 
ctagggcctt gtggggtcag ttccggctgg 
tcaggaacaa gcgggcactg ctcgacgcac 
ggcgcgctct acgaactgcc gataaacaga 
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catgcgcgcg ccccgcgaaa ccctccagrc 18000 

caagatcgag ccccacagcg tgcaactccc 18060 

cgtggagcgt tcgcgtcgtc tcgaacagga 18120 

cgacacgcga ggaactatga cgaccaagaa 18 ISO 

acaggtcagc gaggccaagc aggccgcgtt 18240 

aatgcagctt tccttgttcg atattgcgcc 18300 

cgacacggcc cgctctgccc tgttcaccac 18360 

gcaaaacaag gtcattttcc acgtcaacaa 18420 

gctgcgggcc gacgatgacg aactggtgtg 18480 

ccctatcggc gagccgatca ccttcacgtt 18540 

gatcaatggc cggtattaca cgaaggccga 18600 

gatgggcttc acgtccgacc gcgttgggca 18660 

ccgcgtcctg gaccgtggca agaaaacgtc 18720 

cgtcgtgctg tttgctggcg accactacac 18780 

gtcgccgacg gcccgacgga tgttcgacta 18840 

caagctggaa accttccgcc tcatgtgcgg 18900 

cgagcaggtc ggcgaagcct gcgaagagtt 18960 

ggtcaatgat gacctggtgc attgcaaacg 19020 

gggttcagca gccagcgctt tactggcatt 19080 

ttgcttcgct cagtatcgct cgggacgcac 19140 

ggattaaaat tgacaattgt gattaaggct 19200 
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cagattcgac ggcttggagc ggccgacgtg caggatttcc gcgagatccg attgtcggcc 19260 

ctgaagaaag ctccagagat gttcgggtcc gtttacgagc accaggagaa aaagcccacg IS320 

gaggcgttcg ctgaacggtt gcgagatgcc gtggcattcg gcgcctacat cgacggcgag 1S3S0 

atcattgggc tgtcggtctt caaacaggag gacggcccca aggacgcrca caaggcgcat 19440 

ctgtccggcg tt rtcgtgga gcccgaacag cgaggccgag gggtcgccgg tatgctgctg 19500 

cgggcgttgc cggcgggttt attgctcgtg atgatcgtcc gacagattcc aacgggaatc 19560 

tggtggatgc gcatcttcat cctcggcgca cttaatattt cgctattctg gagcttgttg 19620 

tttatttcgg tctaccgcct gccgggcggg gtcgcggcga cggtaggcgc tgtgcagccg 19680 

ctgatggtcg tgttcatctc tgccgctctg ctaggtagcc cgatacgatt gatggcggtc 19740 

ctgggggcta tttgcggaac tgcgggcgtg gcgctgttgg tgttgacacc aaacgcagcg 19800 

ctagatcctg tcggcgtcgc agcgggcctg gcgggggcgg tttccatggc gttcggaacc 19860 

gtgctgaccc gcaagtggca acctcccgtg cctctgctca cctttaccgc ctggcaactg 19920 

gcggccggag gacttctgct cgttccagta gctttagtgt ttgatccgcc aatcccgatg 19980 

cctacaggaa ccaatgttct cggcctggcg tggctcggcc tgatcggagc gggtttaacc 20040 

tacttccttt ggttccgggg gatctcgcga ctcgaaccta cagttgtttc cttactgggc 20100 

tttctcagcc ccagatctgg ggtcgatcag ccggggatgc atcaggccga cagtcggaac 20160 

ttcgggtccc cgacctgtac cattcggtga gcaatggata ggggagttga tatcgtcaac 20220 

gttcacttct aaagaaatag cgccactcag cttcctcagc ggctttatcc agcgatttcc 20280 

tattatgtcg gcatagttct caagatcgac agcctgtcac ggttaagcga gaaatgaata 20340 

agaaggctga taattcggat ctctgcgagg gagatgatat ttgatcacag gcagcaacgc 20400 

tctgtcatcg ttacaatcaa catgctaccc tccgcgagat catccgtgtt tcaaacccgg 20460 
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cagcttagtt gccgttcttc cgaatagcat cggtaacatg agcaaagtcr cccgccttac 20520 

aacgcctctc ccgctgacgc cgtcccggac tgatgggctg cctgtatcga grgg rgattt 20580 

tgtgccgagc tgccggtcgg ggagctgttg gctggctggt ggcaggatat attgrggtgc 20640 

aaacaaatrg acgcttagac aacttaataa cacattgcgg acgtttttaa rctactgggg 20700 

tggtttttct tttcaccagt gagacgggca acagctgatt gcccttcacc gcctggccct 20760 

gagagagttg cagcaagcgg tccacgctgg tttgccccag caggcgaaaa tcctgtttga 20820 

tggtggttcc gaaancggca aaatccctta taaatcaaaa gaatagcccg agatagggtt 20880 

gagtgttgtt ccagtttgga acaagagtcc actattaaag aacgtggact ccaacgtcaa 20940 

agggcgaaaa accgtctatc agggcgatgg cccactacgt gaaccatcac ccaaatcaag 21000 

ttttttgggg tcgaggtgcc gtaaagcact aaatcggaac cctaaaggga gcccccgatt 21060 

tagagcttga cggggaaagc cggcgaacgt ggcgagaaag gaagggaaga aagcgaaagg 21120 

agcgggcgcc attcaggctg cgcaactgtt gggaagggcg atcggtgcgg gcctcttcgc 21180 

tattacgcca gctggcgaaa gggggatgtg ctgcaaggcg attaagttgg gtaacgccag 21240 

ggttttccca gtcacgacgt tgtaaaacga cggccagtga attcgagctc ggtacccggg 21300 



<210> 47 

<211> 17756 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid 



<220> 

<221> miscjeature 

<222> (10264) . . (10264) 

<223> n is a, c, g, or t 
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<220> 

<221> mis cofeature 

<222> (10472) . . (10472) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10563) . . (10563) 

<223> n is a, c, g, or t 

<400> 47 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 



aa 



aa 



60 



acgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 120 

aacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 180 

ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 300 

caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 

atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 
ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 
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gcaattriLc ccttgggcat tcttggcata gtggtcccgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattaraccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 

tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 

ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 
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utttctgcta titaaggttt tagaatgcaa gaaacagtca attggagttc gtcctgrtat 2220 

aaitagctrc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatcgc 22S0 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga aigictcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 24 60 

catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 
tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 
ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 
cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 
gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 
gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 
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ggttttccgc cgaggargcc gaaaccatcg caacccgcac cgtcatgcgt gcgccccgcg 34S0 

aaaccttcca ciccctcggc tcgatggtcc agcaacctac ggccaacatc gaccccqaca 3540 

gcgTigcaacr ggctccccct gccctgcccg cgccatcggc cgccgtggag cgtrcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgrggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt • 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 

gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4 620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4680 
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tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgatrgtcc gccctgaaga aagctccaga catgttcggg tccgtrtacc aqcacgagga 4S00 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgrggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggca atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg . cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 5460 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 
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ctticaaacc cgccagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactqatggc cigcc^giai 6060 

cgagrggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatatcgrgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 61S0 

taatgtaccg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagaragg gttgagrgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 
ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 
agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 
cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 
tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 
ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 
cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 
gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 
gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 
aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 
aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 
aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 
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cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcaragct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagrttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga crgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 8460 
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gcctaacctc cctaattggc ccatccggca tctgtacggc gtccaaatat cgugcctcic 8520 

ctgctttgcc cggrgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgaccig ctgaggtccc S640 

tcagtcccig graggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttigaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgt gcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacart ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

qtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 
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aggatcgccg cggctccggg cgtatatcct ccgcattggt ctigaccaac tctatcagag 9780 

crtcgtrcac gccaatttcg atgatgcagc ttgggcgcag ggtccatgcg acgcaatcgl: 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatcgc ngtgragaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt cattttgctt 10800 

tgtaaatttc tggtaactgc caccaagaaa tatgaggata ttcgtgatgt tcctcgtggt 10860 

agccaaaatg atagcacgtg ataaatgacc accaaatagg acggctaatt gtttgggcac 10920 

aatgaggctg aacataaccc cctattggtt cactatgggg taaaaaagta ccaaaataga 10980 
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ataattgtaa tcaacttaaa agcgagggta gcacccaaaa gtaagttaga tiarcacttc 11040 

gcatatcgag tatgnattta gcaaagttat aaataatagt caacgcaatt atttgccccc 11100 

aactccagta accrttcata aaatgaaaat accaagcaaa gaaactntgg rgtttaccat 11160 

tgtgaaaatc cgggtcratr gagcttgctg gattgtggtg gtgtaaccaa tgtttrttca 11220 

atagtttttg acatggtaaa agaccataaa gggatagggt caatgttcca atcaaargat 11280 

taatcttggt gttttgggga aatactacgc catgcatggc atcatgagat gtaataaata 11340 

atcccgtata taaaaatgit tgccatagta taacaggcaa taacatccaa aattttagct 11400 

ttgagatgtc aagggaaagt aataaactca ggctaatgac ccatgcgcta acaatgacaa 11460 

tagcaatgaa aagcccctta aactgagatt tacttctcag tactggagtc agttttgctt 11520 

gatgactgag tggttgttct aactggatca tttctaaaga gaaggtggaa caatgttagc 11580 

ataattgtgc ttgagtgagg actttgaggg taggtacata cttgataaag ttaatgatta 11640 

aagagaaaaa aaaagttttg gttcaaagca gaaattgttt tttaaatcga ttggtgagaa 11700 

aatttttttc tgtttccgca tcaccaaagc cacctcagga atggtcacaa attattggtc 11760 

tgattggacc ataagcatac aaaaagttca ttgaagtata cttagtggct tattagactt 11820 

ttatcgtttt ctaacgcgaa tcagcaatgt ttcttgtttg atttactgct tgctttagat 11880 

catttttgtc tgaaatatta tgcatttgtt caaagcggcc tttgtttcct ttctttcatg 11940 

cttaaacacg ttgtttattc catatattac tttgaatatg catcaccgca aagcggaagt 12000 

gcaaaataac aaagaacctc tttgggttac acgatcaact gctattgtga aaaaaatttc 12060 

tttttgaaaa tttttggaat aatatctctt gcaaaaaaga aattttgtat atttagtagc 12120 

atcaagaaca aatgaaagaa gtgtgggata acaagaatac atcatcttta gacaaaagta 12180 

cgagaaaaat ctaataagtt gttatagagg tctttgtttt ctttgtgttt atagacagtt 12240 
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atttagagtt tgaaaagtgt ctctaatgtg tcttttttta ttattattat itcaaatctt 12300 

atgtaatata gctaaagcta tagatttgac attttttcta aatataaaat ttcagucaac 12360 

agaaataaat gacacgagtt ctttttctct ctctcaatcc tgttgatcat caatctttga 12420 

tgtcgtttta aaacaaatga atggcattta gttccttagg tgtcacrcac atcttgttga 124S0 

ccagaaaatc cttattcgcc ctcaaatctg ctttattcct ttcatttgat ttgatgttta 12540 

agtaatgcaa gcaaacaaaa aagaaacctt tcttgcaaag acaaaagaat tgttttcaga 12600 

ggaaagcaac tcgttgtcat tttttaagga tttagactta taatcgacac catagtttgt 12660 

ccgttacatt ttttattgtc gttttctgat ttccttttaa tctttaagca aaatcaatat 12720 

taacttatct tgtcttccaa taaaaaatgg ataccaataa caataaatcc ttcacaaaga 12780 

aaaaaaaaaa aaactcgaaa aaagcttggc gtaatcatgg tcatagctgt ttcctgtgtg 12840 

aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa agtgtaaagc 12900 

ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac tgcccgcttt 12960 

ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg cggggagagg 13020 

cggtttgcgt attgggccaa agacaaaagg gcgacattca accgattgag ggagggaagg 13080 

taaatattga cggaaattat tcattaaagg tgaattatca ccgtcaccga cttgagccat 13140 

ttgggaatta gagccagcaa aatcaccagt agcaccatta ccattagcaa ggccggaaac 13200 

gtcaccaatg aaaccatcga tagcagcacc gtaatcagta gcgacagaat caagtttgcc 13260 

tttagcgtca gactgtagcg cgttttcatc ggcattttcg gtcatagccc ccttattagc 13320 

gtttgccatc ttttcataat caaaatcacc ggaaccagag ccaccaccgg aaccgcctcc 13380 

ctcagagccg ccaccctcag aaccgccacc ctcagagcca ccaccctcag agccgccacc 134 40 

agaaccacca ccagagccgc cgccagcatt gacaggaggc ccgatctagt aacatagatg 13500 
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acaccgcgcg cgataattta tcctagtttg cgcgctatar tttcttttct atcgcgtatt: 13560 

aaatgtataa ttgcgggact ctaatcataa aaacccatct cataaataac gtcatgcatt 13620 

acatgttaat tatiacatgc ttaacgtaat tcaacagaaa ttatatgata atcatcgcaa 13680 

gaccggcaac aggattcaat cttaagaaac tttattgcca aatgtttgaa cgatcgggga 13740 

tcatccgggt ctgrggcggg aactccacga aaatatccga acgcagcaag atatcgcggt 13800 

gcarctcggt cttgcctggg cagtcgccgc cgacgccgtt gatgtggacg ccgggcccga 13860 

tcatattgtc gcrcaggatc gtggcgttgt gc.ttgtcggc cgttgctgtc gtaatgatat 13920 

cggcaccttc gaccgcctgt tccgcagaga tcccgtgggc gaagaactcc agcatgagat 13980 

ccccgcgctg gaggatcatc cagccggcgt cccggaaaac gattccgaag cccaaccttt 14040 

catagaaggc ggcggtggaa tcgaaatctc gtgatggcag gttgggcgtc gcttggtcgg 14100 

tcatttcgaa ccccagagtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat 14160 

gcgctgcgaa tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc 14220 

gccaagctct tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac 14280 

acccagccgg ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg 14340 

caagcaggca tcgccatggg tcacgacgag atcatcgccg tcgggcatgc gcgccttgag 14400 

cctggcgaac agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc 14460 

gacaagaccg gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc 14520 

gaatgggcag gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga 14580 

tactttctcg gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa 14640 

tagcagccag tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc 14700 

cgtcgtggcc agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgqa 14760 
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caccLcggrc tccccaaaaa gaaccgggcg cccctgcgct gacacccgga acacggcggc 14S20 

atcagagcag ccgartgtct grtgtgccca gtcatagccc aataccctct ccacccaagc 14880 

ggccggagaa cctgcgtgca atccatcttg ttcaatcatg cgaaacgatc cagatccggt 14940 

gcagattatt tggattgaga gtgaatatga gactctaatt ggataccgag gggaatttat 15000 

ggaacgtcag tggagcattt ttgacaagaa atatttgcta gctgatagtg accttaggcg 15060 

acttttgaac gcgcaataat ggtttctgac gtatgtgctt agctcattaa actccagaaa 15120 

cccgcggctg agtggctcct tcaacgttgc ggttctgtca gttccaaacg taaaacggct 15180 

tgtcccgcgt catcggcggg ggtcataacg tgactccctt aattctccgc tcatgatcag 15240 

attgtcgttt cccgccttca gtttaaacta tcagtgtttg acaggatata ttggcgggta 15300 

aacctaagag aaaagagcgt ttattagaat aatcggatat ttaaaagggc gtgaaaaggt 15360 

•ttatccgttc gtccatttgt atgtgcatgc caaccacagg gttccccaga tctggcgccg 15420 

gccagcgaga cgagcaagat tggccgccgc ccgaaacgat ccgacagcgc gcccagcaca 15480 

ggtgcgcagg caaattgcac caacgcatac agcgccagca gaatgccata gtgggcggtg 15540 

acgtcgttcg agtgaaccag atcgcgcagg aggcccggca gcaccggcat aatcaggccg 15600 

atgccgacag cgtcgagcgc gacagtgctc agaattacga tcaggggtat gttgggtttc 15660 

acgtctggcc tccggaccag cctccgctgg tccgattgaa cgcgcggatt ctttatcact 15720 

gataagttgg tggacatatt atgtttatca gtgataaagt gtcaagcatg acaaagttgc 15780 

agccgaatac agtgatccgt gccgccctgg acctgttgaa cgaggtcggc gtagacggtc 15840 

tgacgacacg caaactggcg gaacggttgg gggttcagca gccggcgctt tactggcact 15900 

tcaggaacaa gcgggcgctg ctcgacgcac tggccgaagc catgctggcg gagaatcata 15960 

cgcattcggt gccgagagcc gacgacgact ggcgctcatt tctgatcggg aatgcccgca 16020 
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ccrtcaggca ogcgctcctc gcctaccgcg atggcgcgcg catccatgcc ggcacgcgac I60S0 

cgggcgcacc gcagatggaa acggccgacg cgcagcitcg cttcctctgc gaggcgggtt 16140 

tttcggccgg ggacgccgtc aatgcgctga tgacaatcag ctacttcact gtnggggccg 16200 

tgcttgagga gcaggccggc gacagcgatg ccggcgagcg cggcggcacc gtigaacagg 16260 

ctccgctctc gccgcrgttg cgggccgcga tagacgcctt cgacgaagcc ggnccggacg 16320 

cagcgttcga gcagggactc gcggtgattg tcgatggatt ggcgaaaagg aggctcgttg 16380 

tcaggaacgt tgaaggaccg agaaagggtg acgattgatc aggaccgctg ccggagcgca 16440 

acccactcac tacagcagag ccatgtagac aacatcccct ccccctttcc accgcgtcag 16500 

acgcccgtag cagcccgcta cgggcttttt catgccctgc cctagcgtcc aagcctcacg 16560 

gccgcgctcg gcctctctgg cggccttctg gcgctcttcc gcttcctcgc tcactgactc 16620 

gctgcgctcg gtcgttcggc tgcggcgagc ggt-atcagct cactcaaagg cggtaatacg 16680 

gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa 16740 

ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga 16800 

cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag 16860 

ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct 16.920 

taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgcttttcc gctgcataac 16980 

cctgcttcgg ggtcattata gcgatttttt cggtatatcc atcctttttc gcacgatata 17040 

caggattttg ccaaagggtt cgtgtagact ttccttggtg tatccaacgg cgtcagccgg 17100 

gcaggatagg tgaagtaggc ccacccgcga gcgggtgttc cttcttcact gtcccttatt 17160 

cgcacctggc ggtgctcaac gggaatcctg ctctgcgagg ctggccggct accgccggcg 17220 

taacagatga gggcaagcgg atggctgatg aaaccaagcc aaccaggaag ggcagcccac 17280 
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ctatcaaggt gtactgcctt ccagacgaac gaagagcgat tgaggaaaag gcggcggcgg 17340 

ccggcatgag cctgtcggcc tacctgctgg ccgrcggcca gggcracaaa atcacgggcg 17400 

tcgtggacta tgagcacgtc cgcgagctgg cccgcatcaa tggcgacctg ggccgcctgg 17460 

gcggcctgct gaaactctgg ctcaccgacg acccgcgcac ggcgcggttc ggrgargcca 17520 

cgatcctcgc cctgctggcg aagatcgaag agaagcagga cgagcttggc aaggtcatga 17580 

tgggcgtggt ccgcccgagg gcagagccat gactttttta gccgctaaaa cggccggggg 17640 

gtgcgcgtga ttgccaagca cgtccccatg cgctccatca agaagagcga cttcgcggag 17700 

ctggtgaagt acatcaccga cgagcaaggc aagaccgagc gcctttgcga cgctca 17756 
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ccgggctggt tgccctcccc gctgggctgg cggccgtcts tggccctgca aacgcgccag 60 

aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 120 

aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 180 

ccggcgcggc gutgacagat gaggggcagg ctcgatrtcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgectgat tttacgegag tttcccacag atgatigtgga 300 

caagcctggg gataagtgee ctgeggtatt gacacttgag gggegegact actgacagat 360 

gaggggegeg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

tttteggeca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gecgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc - 660 

tcggccgcga acggcctcac cccaaaaatg geagegctgg cagtccttgc cattgeeggg 720 

ateggggcag taacgggatg ggegatcage ccgagcgcga cgcccggaag cattgaegtg 780 

ccgcaggtgc tggcatcgac attcagegae caggtgeegg gcagtgaggg cggcggcctg 840 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca eggacttcat ggcggggccg .900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagegee atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgecttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 
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craatgcttg aaacccagga caataacctt ataccttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcstgc 13S0 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccac grgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgcrgat t ac gtgcaccttt 150C 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc . 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 

tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 

ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgt tat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 24 60 

catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 
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tgatggctgg agccatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 25S0 

tgaagaicaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcacgctcti 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgaectggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa rcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3.420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 
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tcgarattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3S40 

ccctgttcac cecgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcartt 390C 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg caccccratc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaagcc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg ■ 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 

gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc ' cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 
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iiccaacggga atctggtgga tgcgcatctt catcctcggc ccacrtaata tttcgctatt 510C 

crggagcrtg ttgrttattt cggrctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgctgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 5460 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg tacoattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 
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aaarcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagaragg gttgagtgtt gttccagttt ggaacaagag tccaciatta aagaacgtcg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaarc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aacccraaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 
tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 
gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 
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tagcagitcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 76S0 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaatrct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catga.cttga gcctaaaatc cgccgcct.cc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc ■ 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 8460 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 
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tatttcccct aagtaagtac uttgctacat ccatactcca tccitcccat cccttattcc SSSO 

tutcaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gcraccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct ' tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 



BASF NAE 878/03 269/357 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttctccaa gcagcaaaca 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacccgtt ttcgggrtca 10200 

cctcttccag aracagctca tctgcaatgc attaatgcat tgactgcaac ctagraaccc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac ta'tatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt gagattaaaa 10800 

tagataagga aaagaaagtg aaaagaaatt cggaagcatg gcacattctt ctttttataa 10860 

atacatgcct gactttcttt ttccatcgat atgatatatg catatgatag atatacaagc 10920 

aatcttcttc aaggagtttg aaattttgtc ctccaggagc aaaaaaaagt ttttttttat 10980 

acatgtttgt acacaagaat agttaccaat ttgctttggt cttacgtgct gcaagtttat 11040 

atcgttttca atttctttgt ctttacattt tctttgtcct ttatctttcc tcatttagtc 11100 

tttgggagaa ttaggaaaag ggagcggaaa ggtaagaaat gcttgcgtat tttactaatt 11160 

cggcaaacat ccaatttggc aaacagcagc ctgtgcaacg ctctcgagat gacagtatct 11220 

ttgattacac tctaaatctc gatgacccga ccaaaaagag cgaacaaaga aataatcttg 11280 

tgcattcgaa tatgatggaa gattttttcc ccct tattct aaatgttgac atagcgtgta 11340 
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tgttacataa 
tat gat ccac 
taaatctcag 
cctgagttta 
tatactatgg 
tggcgtagta 
cctttatggt 
tccagcaagc 
gtattttcat 
ttataacttt 
gctaccctcg 
tgaaccaata 
gtggtcatt t 
tatttcttgg 
ggtcatagct 
ccggaagcat 
cgttgcgct c 
tcggccaacg 
caaccgattg 
caccgtcacc 
taccattagc 
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acaaaaagaa 
1 1 agaacaac 
tttaaggggc 
t t actttccc 
caaacatttt 
tttccccaaa 
cttttaccat 
t caat agacc 
t ttat gaaag 
gctaaataca 
cttttaagtt 
gggggttatg 
atcacgtgct 
tggcagttac 
gtttcctgtg 
aaagtgtaaa 
actgcccgct 
cgcggggaga 
agggagggaa 
gacttgagcc 
aaggccggaa 



attgtacaaa 
cactcagtca 
ttttcattgc 
ttgacatctc 
tatatacggg 
acaccaagat 
atcaaaaact 
cggattttca 
gttactggag 
tactccatat 
cattacaatt 
ttcagcctca 
atcattttgg 
cagaaatt ta 
tgaaattgtt 
gcctggggtg 
ttccagtcgg 
ggcggtttgc 
ggtaaatatt 
atttgggaat 
acgtcaccaa 



ctttctttrc 
tcaagcaaaa 
tattgtcatt 
aaagctaaaa 
attatttatt 
taatcatttg 
attgaaaaaa 
caatggtaaa 
ttgggggcaa 
cccaagtgat 
attctatttt 
ttgtgcccaa 
ctaccacgag 
caaagcaaaa 
atccgctcac 
cctaatgagt 
gaaacctgtc 
gtattgggcc 
gacggaaatt 
tagagccagc 
tgaaaccatc 



ttctcttttt 

ctgactccag 

gttagcgcat 

ttttggatgt 

acatctcatg 

attggaacat 

cattggttac 

caccaaagtt 

ataattgcgt 

aatctaactt 

ggtacttttt 

acaattagcc 

gaacatcacg 

tagaagcttg 

aattccacac 

gagctaactc 

gtgccagctg 

aaagacaaaa 

attcattaaa 

aaaatcacca 

gatagcagca 



attttatctc 

t actgagaag 

gggtcatcag 

tattgcctgt 

atgccatgca 

tgaccctatc 

accaccacaa 

tctttgcttg 

tgactattat 

acttttgggt 

taccccatag 

gtcctatttg 

aatatcctca 

gcgtaatcat 

aacatacgag 

acattaattg 

cattaatgaa 

gggcgacatt 

ggtgaattat 

gtagcaccat 

ccgtaatcag 



11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 
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tagcgecaga arcaagtttg cctttagcgt cagactgtag cgcgttttca tcggcattrt 12660 

cgctcatagc ccccttatta gcgtttgcca tcttttcata atcaaaatca ccggaaccag 12720 

agccaccacc ggaaccgcct ccctcagagc cgccaccctc agaaccgcca ccctcagagc 12780 

caccaccctc agagccgcca ccagaaccac caccagagcc gccgccagca ttgacaggag 12840 

gcccgatcta gtaacataga tgacaccgcg cgcgataatt tatcctagtt tgcgcgctat 12900 

attttgtttt ctatcgcgta ttaaatgtat aattgcggga ctctaatcat aaaaacccat 12960 

ctcataaata acgtcatgca ttacatgtta attattacat gcttaacgta attcaacaga 13020 

aattatatga taatcatcgc aagaccggca acaggattca atcttaagaa actttattgc 13080 

caaatgtttg aacgatcggg gatcatccgg gtctgtggcg ggaactccac gaaaatatcc 13140 

gaacgcagca agatatcgcg gtgcatctcg gtcttgcctg ggcagtcgcc gccgacgccg 13200 

ttgatgtgga cgccgggccc gatcatattg tcgetcagga tcgtggcgtt gtgcttgtcg 13260 

gccgttgctg tcgtaatgat atcggcacct tcgaccgcct gttccgcaga gatcccgtgg 13320 

gcgaagaact ccagcatgag atccccgcgc tggaggatca tccagccggc gtcccggaaa 13380 

acgattccga agcccaacct ttcatagaag gcggcggtgg aatcgaaatc tcgtgatggc 13440 

aggttgggcg tcgcttggtc ggtcatttcg aaccccagag tcccgctcag aagaactcgt 13500 

caagaaggcg atagaaggcg atgcgctgcg aatcgggagc ggcgataccg taaagcacga 13560 

ggaagcggtc agcccattcg ccgccaagct cttcagcaat atcacgggta gccaacgcta 13620 

tgtcctgata gcggtccgcc acacccagcc ggccacagtc gatgaatcca gaaaagcggc 13680 

cattttccac catgatattc ggcaagcagg catcgccatg ggtcacgacg agatcatcgc 13740 

cgtcgggcat gcgcgccttg agcctggcga acagttcggc tggcgcgagc ccctgatgct 13800 

cttcgtccag atcatcctga tcgacaagac cggcttccat ccgagtacgt gctcgctcga 13860 
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tgcgatgttt cgcttggtgg tcgaatgggc aggtagccgg atcaagcgta tgcagccgcc 13920 

gcattgcatc agccatgatg gatactttct cggcaggagc aaggtgagat gacacgagat I39S0 

cctgccccgg cacttcgccc aatagcagcc agtcccttcc cgcttcagtg acaacgtcga 14040 

gcacagctgc gcaaggaacg cccgtcgtgg ccagccacga tagccgcgct gcctcgtcct 14100 

gcagttcatt cagggcaccg gacaggtcgg tcttgacaaa aagaaccggg cgcccctgcg 14160 

ctgacagccg gaacacggcg gcatcagagc agccgattgt ctgttgtgcc cagtcatagc 14220 

cgaatagcct ctccacccaa gcggccggag aacctgcgtg caatccatct tgttcaatca 14280 

tgcgaaacga tccagatccg gtgcagatta tttggattga gagtgaatat gagactct.aa 14340 

ttggataccg aggggaattt atggaacgtc agtggagcat ttttgacaag aaatatttgc 14400 

tagctgatag tgaccttagg cgacttttga acgcgcaata atggtttctg acgtatgtgc 14460 

ttagctcatt aaactccaga aacccgcggc tgagtggctc cttcaacgtt gcggttctgt 14520 

cagttccaaa cgtaaaacgg cttgtcccgc gtcatcggcg ggggtcataa cgtgactccc 14580 

ttaattctcc gctcatgatc agattgtcgt ttcccgcctt cagtttaaac tatcagtgtt 14640 

tgacaggata tattggcggg taaacctaag agaaaagagc gtttattaga ataatcggat 14700 

atttaaaagg gcgtgaaaag gtttatccgt tcgtccattt gtatgtgcat gccaaccaca 14760 

gggttcccca gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg 14820 

atccgacagc gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag 14880 

cagaatgcca tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg 14940 

cagcaccggc ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac 15000 

gatcaggggt atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg 15060 

aacgcgcgga ttctttatea ctgataagtt ggtggacata ttatgtttat cagtgataaa 15120 
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gtgtcaagca tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgrtg 15180 

aacgaggtcg ccgtagacgg tctgacgaca cgcaaactgg cggaacggtt gcgggtrcag 15240 

cagccggcgc tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa 15300 

gccatgctgg cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca 15360 

tttctgatcg ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg 15420 

cgcatccatg ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt 15480 

cgcttcctct gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc 15540 

agctacttca ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag 15600 

cgcggcggca ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc 15660 

ttcgacgaag ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga 15720 

ttggcgaaaa ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga 15780 

tcaggaccgc tgccggagcg caacccactc actacagcag agccatgtag acaacatccc 15840 

ctcccccttt ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct 15900 

gccctagcgt ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt 15960 

ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag 16020 

ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca 16080 

tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt 16140 

tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc 16200 

gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct 16260 

ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg 16320 

tggcgctttt ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat 16380 
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ccatcctttt tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg 16440 

tgtatccaac ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc cagcgggtgt 16500 

tccttcttca crgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga 16560 

ggctggccgg ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag 16620 

ccaaccagga agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg 16680 

attgaggaaa aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc 16740 

cagggctaca aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc 16800 

aatggcgacc tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc 16860 

acggcgcggt tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag 16920 

gacgagcttg gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt 16980 

tagccgctaa aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat 17040 

caagaagagc gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga 17100 

gcgcctttgc gacgctca 17118 
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<400> 49 

gatctttcga cactgaaata cgtcgagcct gctccgcttg gaagcggcga ggagcctcgt 60 

cctgtcacaa ctaccaacat ggagtacgat aagggccagt tccgccagct cattaagagc 120 

cagttcatgg gcgttggcat gatggccgtc atgcatctgt acttcaagta caccaacgct 180 

cttctgatcc agtcgatcat ccgctgaagg cgctttcgaa tctggttaag atccacgtct 240 

tcgggaagcc agcgactggt gacctccagc gtccctttaa ggctgccaac agctttctca 300 

gccagggcca gcccaagacc gacaaggcct ccctccagaa cgccgagaag aactggaggg 360 

gtggtgtcaa ggaggagtaa gctccttatt gaagtcggag gacggagcgg tgtcaagagg 420 

atattcttcg actctgtatt atagataaga tgatgaggaa ttggaggtag catagcttca 480 

tttggatttg ctttccaggc tgagactcta gcttggagca tagagggtcc tttggctttc 540 

aatattctca agtatctcga gtttgaactt attccctgtg aaccttttat tcaccaatga 600 

gcattggaat gaacatgaat ctgaggactg caatcgccat gaggttttcg aaatacatcc 660 

ggatgtcgaa ggcttggggc acctgcgttg gttgaattta gaacgtggca ctattgatca 720 

tccgatagct ctgcaaaggg cgttgcacaa tgcaagtcaa acgttgctag cagttccagg 780 

tggaatgtta tgatgagcat tgtattaaat caggagatat agcatgatct ctagttagct 840 

caccacaaaa gtcagacggc gtaaccaaaa gtcacacaac acaagctgta aggatttcgg 900 

cacggctacg gaagacggag aagccacctt cagtggactc gagtaccatt taattctatt 960 
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tgtgtttgat cgagacctaa tacagcccct acaacgacca tcaaagtcgt atagctacca 1020 

gtgaggaagt ggactcaaat cgacttcagc aacatctcct ggataaactt taagcctaaa 1080 

ctatacagaa taagataggt ggagagctta taccgagctc ccaaatctgt ccagatcatg 1140 

gttgaccggt gcctggatct tcctatagaa tcatccttat tcgttgacct agctgattct 1200 

ggagtgaccc agagggtcat gacttgagcc taaaatccgc cgcctccacc atttgtagaa 1260 

aaatgtgacg aactcgtgag ctctgtacag tgaccggtga ctctttctgg catgcggaga 1320 

gacggacgga cgcagagaga agggctgagt aataagccac tggccagaca gctctggcgg 1380 

ctctgaggtg cagtggatga ttattaatcc gggaccggcc gcccctccgc cccgaagtgg 1440 

aaaggctggt gtgcccctcg ttgaccaaga atctattgca tcatcggaga atatggagct 1500 

tcatcgaatc accggcagta agcgaaggag aatgtgaagc caggggtgta tagccgtcgg 1560 

cgaaatagca tgccattaac ctaggtacag aagtccaatt gcttccgatc tggtaaaaga 1620 • 

ttcacgagat agtaccttct ccgaagtagg tagagcgagt acccggcgcg taagctccct 1680 

aattggccca tccggcatct gtagggcgtc caaatatcgt gcctctcctg ctttgcccgg 1740 

tgtatgaaac cggaaaggcc gctcaggagc tggccagcgg cgcagaccgg gaacacaagc 1800 

tggcagtcga cccatccggt gctctgcact cgacctgctg aggtccctca gtccctggta 1860 

ggcagctttg ccccgtctgt ccgcccggtg tgtcggcggg gttgacaagg tcgtt gcgtc 1920 

agtccaacat ttgttgccat attttcctgc tctccccacc agctgctctt ttcttttctc 1980 

tttcttttcc catcttcagt atattcatct tcccatccaa gaacctttat ttcccctaag 2040 

taagtacttt gctacatcca tactccatcc ttcccatccc ttattccttt gaacctttca 2100 

gttcgagctt tcccacttca tcgcagcttg actaacagct accccgcttg agcagacatc 2160 

accatgcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa gttcgacagc 2220 
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ctctccgacc tgatgcagct ctcggaggcc gaagaatctc grgetttcag cttcgatcta 2280 

ggagggcgtg gatatgtcci gcgggtaaat agctgcgccg atggttrcta caaagatcgt 2340 

tatgtttatc ggcactttgc atcggccgcg ctcccgatic cggaagtgct tgacattggg 2400 

gaattcagcg agagcctgac ctattgcatc tcccgccgrg cacagggtgt cacgttgcaa 2460 

gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg 2520 

atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc 2580 

ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac 2640 

tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg 2700 

atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga tttcggctcc 2760 

aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag cgaggcgatg 2820 

ttcggggatt cccaatacga ggtcgccaac atcttcttct ggaggccgtg gttggcttgt 2880 

atggagcagc agacgcgcta cttcgagcgg aggcatccgg agcttgcagg atcgccgcgg 2940 

ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt ggttgacggc 3000 

aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg atccggagcc 3060 

gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac cgatggctgt 3120 

gtagaagtac tcgccgatag tggaaaccga cgccccagca ctcgtccgag ggcaaaggaa 3180 

tagagtagat gccgaccgcg ggatcgatcc acttaacgtt actgaaatca tcaaacagct 3240 

tgacgaatct ggatataaga tcgttggtgt cgatgtcagc tccggagttg agacaaatgg 3300 

tgttcaggat ctcgataaga tacgttcatt tgtccaagca gcaaagagtg ccttctagtg 3360 

atttaatagc tccatgtcaa caagaataaa acgcgttttc gggtttacct cttccagata 3420 

cagctcatct gcaatgcatt aatgcattga ctgcaaccta gtaacgcctt ncaggctccg 3480 
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gcgaagagaa gaatagctta gcagagctat tttcattttc gggagacgag atcaagcaga 3540 

tcaacgctcc tcaacagacc tacgagactg aggaatccgc tcttggctcc accccactat 3600 

atatttgtct ctaattgtac tttgacatgc tcctcttctt tactctgata gcttgactat 3660 

gaaaattccg tcaccagcnc ctgggttcgc aaagataatt gcatgtttct tccttgaact 3720 

ctcaagccta caggacacac attcatcgta ggtataaacc tcgaaatcan ttcctactaa 3780 

t 

gatggratac aatagtaacc atgcatggtt gcctagtgaa tgctccgtaa cacccaatac 3840 

gccggccgaa acttttttac aactctccta tgagtcgttt acccagaatg cacaggtaca 3900 

cttgtttaga ggtaatcctt ctttctagct agaagtcctc gtgtactgtg taagcgccca 3960 

ctccacatct ccactcgacc tgcaggcatg caaagcttga gattaaaata gataaggaaa 4020 

agaaagtgaa aagaaattcg gaagcatqgc acattcttct ttttataaat acatgcctga 4080 

ctttcttttt ccatcgatat gatatatgca tatgatagat atacaagcaa tcttcttcaa 4140 

ggagtttgaa attttgtcct ccaggagcaa aaaaaagttt ttttttatac atgtttgtac 4200 

acaagaatag ttaccaattt gctttggtct tacgtgctgc aagtttatat cgttttcaat 4260 

ttctttgtct ttacattttc tttgtccttt atctttcctc atttagtctt tgggagaatt 4320 

aggaaaaggg agcggaaagg taagaaatgc ttgcgtattt tactaattcg gcaaacatcc 4380 

aatttggcaa acagcagcct gtgcaacgct ctcgagatga cagtatcttt gattacactc 4440 

taaatctcga tgacccgacc aaaaagagcg aacaaagaaa taatcttgtg cattcgaata 4500 

tgatggaaga ttttttcccc cttattctaa atgttgacat agcgtgtatg ttatataaac 4560 

aaaaagaaat tgtacaaact ttcttttctt ctctttttat tttatctcta tgctgtcgaa 4620 

gctgcagtca atcagcgtca aggcccgccg cgttgaacta gcccgcgaca tcacgcggcc 4680 

caaagtctgc ctgcatgctc agcggtgctc gttagttcgg ctgcgagtgg cagcaccaca 4740 
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gacagaggag gcgctgggaa ccgtgcaggc tgccggcgcg ggcgatgagc acagcgccga 4300 

tgtagcactc cagcagcttg accgggcuat cgcagagcgt cgtgcccggc gcaaacggga 4360 

gcagctgtca taccaggctg ccgccattgc agcatcaatt ggcgrgtcag gcatigccai 4920 

citccccacc tacctgagat ttgccatgca catgaccgtg ggcggcgcag tgccatgggg 4980 

tgaagtggct ggcactctcc tcttggtggt tggtggcgcg ctcggcatgg agatgtatgc 5040 

ccgctatgca cacaaagcca tctggcatga gtcgcctctg ggctggctgc tgcacaagag 5100 

ccaccacaca cctcgcactg gaccctttga agccaacgac ttgtttgcaa tcatcaatgg 5160 

actgcccgcc atgctcctgt gtacctttgg cttctggctg cccaacgtcc tgggggcggc 5220 

ctgctttgga gcggggctgg gcatcacgct atacggcatg gcatatatgt ttgtacacga 5280 

tggcctggtg cacaggcgct ttcccaccgg gcccatcgct ggcctgccct acatgaagcg 5340 

cctgacagtg gcccaccagc tacaccacag cggcaagtac ggtggcgcgc cctggggtat • 54 00 

gttcttgggt ccacaggagc tgcagcacat tccaggtgcg gcggaggagg tggagcgact 54 60 

ggtcctggaa ctggactggt ccaagcgggc gattgtgact gatagcgaga ctctgggtcg 5520 

atgttatctg cctcaacaat ggcttagaaa agaagaaaca gaacaaatac agcaaggcaa 5580 

cgcccgtagc ctaggtgatc aaagactgtt gggcttgtct ctgaagcttg taggaaaggc 5640 

agacgctatc atggtgagag ctaagaaggg cattgacaag ttgccggcaa actgtcaagg 5700 

cggtgtacga gctgcttgcc aagtatatgc tgcaattgga tctgtactca agcagcagaa 5760 

gacaacatat cctacaagag ctcatctaaa aggaagcgaa cgtgccaaga ttgctctgtt 5820 

gagtgtatac aacctctatc aatctgaaga caagcctgtg gctctccgtc aagctagaaa 5880 

gattaagagt ttttttgttg attagtgaat ttttgtttta tttatgtctg atagttcaat 5940 

aaagagacaa cacatacaat ataaaatcat tgtctttaaa tgttaattta gtagagtgta 6000 
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aaccctgcat tttttttgta cgcataaaca atgaattcac cccgcttctg gtttttaaat 6060 

aattatgtca aactagggaa aattcttttt tttctcttcg ttcttttrtt ggcttgttgt 6120 

ggagtcacag gcttgtcttc agattgatag aggttgtata cactcaacag agcaatcttg 6180 

gcacgttcgc ttccttttag atgagctctt gtaggatatg ttgtcttctg ctgcrtgagt 6240 

acagatccaa ttgcagcata tacttggcaa gcagctcgta caccgccttg acagtttgcc 6300 

ggcaacttgt caatgccctt cttagctctc accatgatag cgtctgcctt tcctacaagc 6360 

ttcagagaca agcccaacag tctttgatca cctaggctac gggcgttgcc ttgctgtatt 6420 

tgttctgttt cttcttttct aagccattgt tgaggcagat aacatcgacc caacatcctc 6480 

gagccatact acagcataaa aggatacgtt ttctttaaca gaaatttacc cttttgttat 6540 

cagcacatac aaaaaaaaag aaatttaaga tgagtaggac ttccattctc tcaaaaattt 6600 

tattcaatcc ataaatgaat tatttttgga caaaaaagaa agattatgcc tgattt tctc 6660 

tatttttttt ttttttacaa ctccaccaat actttctagc ccagcttggc gtaatcatgg 6720 

tcatagctgt ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc 6780 

ggaagcataa agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg 6840 

ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc 6900 

ggccaacgcg cggggagagg cggtttgcgt attgggccaa agacaaaagg gcgacattca 6960 

accgattgag ggagggaagg taaatattga cggaaattat tcattaaagg tgaattatca 7020 

ccgtcaccga cttgagccat ttgggaatta gagccagcaa aatcaccagt agcaccatta 7080 

ccattagcaa ggccggaaac gtcaccaatg aaaccatcga tagcagcacc gtaatcagta 7140 

gcgacagaat caagtttgcc tttagcgtca gactgtagcg cgttttcatc ggcattttcg 7200 

gtcatagccc ccttattagc gtttgccatc ttttcataat caaaatcacc ggaaccagag 7260 
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ccaccaccgg aaccgcctcc ctcagagccg ccaccctcag aaccgccacc ctcacagcca 7320 

ccaccctcag aaccgccacc agaaccacca ccagagccgc cgccagcatt gacaggaggc 73S0 

ccgatctagt aacatagatg acaccgcgcg cgataattta tcctagttcg cgcgctatat 7440 

tttgrtttct atcgcgtatt aaatgtataa ttgcgggact ctaatcataa aaacccatct 7500 

cataaataac gtcatgcatt acatgttaat tattacatgc ttaacgtaat tcaacagaaa 7560 

ttatatgata atcatcgcaa gaccggcaac aggattcaat cttaagaaac tttattgcca 7620 

aatgtttgaa cgatcgggga tcatccgggt ctgtggcggg aactccacga aaatatccga 7680 

acgcagcaag atatcgcggt gcatctcggt cttgcctggg cagtcgccgc cgacgccgtt 7740 

gatgtggacg ccgggcccga tcatattgtc gctcaggatc • gtggcgttgt gcttgtcggc 7800 

cgttgctgtc gtaatgatat cggcaccttc gaccgcctgt tccgcagaga tcccgtgggc 7860 

gaagaactoc agcatgagat ccccgcgctg gaggatcatc cagccggcgt cccggaaaac 7920 

gattccgaag cccaaccttt catagaaggc ggcggtggaa tcgaaatctc gtgatggcag 7980 

gttgggcgtc gcttggtcgg tcatttcgaa ccccagagtc ccgctcagaa gaactcgtca 8040 

agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg cgataccgta aagcacgagg 8100 

aagcggtcag cccattcgcc gccaagctct tcagcaatat cacgggtagc caacgctatg 8160 

tcctgatagc ggtccgccac acccagccgg ccacagtcga tgaatccaga aaagcggcca 8220 
ttttccacca tgatattcgg caagcaggca tcgccatggg tcacgacgag atcatcgccg 8280 
tcgggcatgc gcgccttgag cctggcgaac agttcggctg gcgcgagccc ctgatgctct 8340 
tcgtccagat catcctgatc gacaagaccg gcttccatcc gagtacgtgc tcgctcgatg 8400 
cgatgtttcg cttggtggtc gaatgggcag gtagccggat caagcgtatg cagccgccgc 8460 
attgcatcag ccatgatgga tactttctcg gcaggagcaa ggtgagatga caggagatcc 8520 
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tgccccggca cttcgcccaa tagcagccag tcccttcccg cttcagtgac aacgtcgagc 8580 

acagctgcgc aaggaacgcc cgtcgtggcc agccacgata gccgcgctgc ctcgtcctgc 8640 

agttcattca gggcaccgga caggtcggtc ttgacaaaaa gaaccgggcg cccctgcgct 8700 

gacagccgga acacggcggc atcagagcag ccgattgtct gttgtgccca gtcatagccg 8760 

aatagcctct ccacccaagc ggccggagaa cctgcgtgca atccatcttg ttcaatcatg 8820 

cgaaacgatc cagatccggt gcagattatt tggattgaga gtgaatatga gactctaatt 8880 

ggataccgag gggaatttat ggaacgtcag tggagcattt ttgacaagaa atatttgcta 8940 

gctgatagtg accttaggcg acttttgaec gcgcaataat ggtttctgac gtatgtgctt 9000 

agctcattaa actccagaaa cccgcggctg agtggctcct tcaacgttgc ggttctgtca 9060 

gttccaaacg taaaacggct tgtcccgcgt catcggcggg ggtcataacg tgactccctt 9120 

aattctccgc tcatgatcag attgtcgttt cccgccttca gtttaaacta tcagtgtttg 9180 

acaggatata ttggcgggta aacctaagag aaaagagcgt ttattagaat aatcggatat 9240 

ttaaaagggc gtgaaaaggt ttatccgttc gtccatttgt atgtgcatgc caaccacagg 9300 

gttccccaga tctggcgccg gccagcgaga cgagcaagat tggccgccgc ccgaaacgat 9360 

ccgacagcgc gcccagcaca ggtgcgcagg caaattgcac caacgcatac agcgccagca 9420 

gaatgccata gtgggcggtg acgtcgttcg agtgaaccag atcgcgcagg aggcccggca 9480 

gcaccggcat aatcaggccg atgccgacag cgtcgagcgc gacagtgctc agaattacga 9540 

tcaggggtat gttgggtttc acgtctggcc tccggaccag cctccgctgg tccgattgaa 9600 

cgcgcggatt ctttatcact gataagttgg tggacatatt atgtttatca gtgataaagt 9660 

gtcaagcatg acaaagttgc agccgaatac agtgatccgt gccgccctgg acctgttgaa 9720 

cgaggtcggc gtagacggtc tgacgacacg caaactggcg gaacggttgg gggttcagca 9780 
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gccgcccctt tactggcact tcaggaacaa gcgggcgctg ctcgacgcac tggccgaagc 9840 

catgcrggcg gagaatcata cgcattcggt gccgagagcc gacgacgact ggcgcrcatt 9900 

tctgatcggg aatgcccgca gcttcaggca ggcgctgctc gcctaccgcg atggcgcgcg 9960 

catccatgcc ggcacgcgac cgggcgcacc gcagatggaa acggccgacg cgcagcttcg I0C20 

cttcctctgc gaggcgggtt tttcggccgg ggacgccgtc aatgcgctga tgacaatcag 10080 

ctacttcact gttggggccg tgcttgagga gcaggccggc gacagcgatg ccggcgagcg 10140 

cggcggcacc gttgaacagg ctccgctctc gccgctgttg cgggccgcga tagacgcctt 10200 

cgacgaagcc ggtccggacg cagcgttcga gcagggactc gcggtgattg tcgatggatt 10260 

ggcgaaaagg aggctcgttg tcaggaacgt tgaaggaccg agaaagggtg acgattgatc 10320 

aggaccgctg ccggagcgca acccactcac tacagcagag ccatgtagac aacatcccct 10380 

ccccctttcc accgcgtcag acgcccgtag cagcccgcta cgggcttttt catgccctgc 10440 

cctagcgtcc aagcctcacg gccgcgctcg gcctctctgg cggccttctg gcgctcttcc 10500 

gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 10560 

cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 10620 

tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 10680 

cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 10740 

aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 10800 

cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 10860 

gcgcttttcc gctgcataac cctgcttcgg ggtcattata gcgatttttt cggtatatcc 10920 

atcctttttc gcacgatata caggattttg ccaaagggtt cgtgtagact ttccttggtg 10980 

tatccaacgg cgtcagccgg gcaggatagg tgaagtaggc ccacccgcga gcgggtgttc 11040 
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cttcttcact gtcccttatt cgcacctggc ggtgctcaac gcgaarcctg ctctgcgagg II100 

ctggccggct accgccggcg taacagatga gggcaagcgg atggctgatg aaaccaagcc 11160 

aaccaggaag ggcagcccac ctatcaaggt gtactgcctt ccagacgaac gaagagcgat 11220 

tgaggaaaag gcggcggcgg ccggcatgag cctgtcggcc tacctgctgg ccgrcggcca 11280 

gggctacaaa atcacgggcg tcgtggacta tgagcacgtc cgcgagctgg cccgcatcaa 11340 

tggcgacctg ggccgcctgg gcggcctgct gaaactctgg ctcaccgacg acccgcgcac 11400 

ggcgcggttc ggtgatgcca cgatcctcgc cctgctggcg aagatcgaag agaagcagga 11460 

cgagcttggc aaggtcatga tgggcgtggt ccgcccgagg gcagagccat gactttttta 11520 

gccgctaaaa cggccggggg gtgcgcgtga ttgccaagca cgtccccatg cgctccatca 11580 

agaagagcga cttcgcggag ctggtgaagt acatcaccga cgagcaaggc aagaccgagc 11640 

gcctttgcga cgctcaccgg gctggttgcc ctcgccgctg ggctggcggc cgtctatggc 11700 

cctgcaaacg cgccagaaac gccgtcgaag ccgtgtgcga gacaccgcgg ccgccggcgt 11760 

tgtggatacc tcgcggaaaa cttggccctc actgacagat gaggggcgga cgttgacact 11820 

tgaggggccg actcacccgg cgcggcgttg acagatgagg ggcaggctcg atttcggccg 11880 

gcgacgtgga gctggccagc ctcgcaaatc ggcgaaaacg cctgatttta cgcgagtttc 11940 

ccacagatga tgtggacaag cctggggata agtgccctgc ggtattgaca cttgaggggc 12000 

gcgactactg acagatgagg ggcgcgatcc ttgacacttg aggggcagag tgctgacaga 12060 

tgaggggcgc acctattgac atttgagggg ctgtccacag gcagaaaatc cagcatttgc 12120 

aagggtttcc gcccgttttt cggccaccgc taacctgtct tttaacctgc ttttaaacca 12180 

atatttataa accttgtttt taaccagggc tgcgccctgt gcgcgtgacc gcgcacgccg 12240 

aaggggggtg cccccccttc tcgaaccctc ccggcccgct aacgcgggcc tcccatcccc 12300 
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ccagcggctg 
ccttqccatl 
cggaagcatt 
tgagggcggc 
cttcatggcg 
cgtgctcgtg 
ataccgaggt 
ttaaaaagct 
attgacaat a 
tttcaggggg 
taaaaacttg 
ctatcataat 
gatgactttg 
tgccaggtgc 
gattacgtgc 
tatcaccacg 
ttcaccgaat 
gcgctggcgc 
gacgtcactg 
cgtaaaatcg 
ttcatggcca 



cgcccctcgg 
gccgcgatcg 
gacgtgccgc 
ggcctgggtg 
gggccggcaa 
ttcgggggtg 
atgaaaacga 
accaagacga 
ctgataagat 
caaggcatag 
catggactaa 
tgggtaatga 
tcatgcagct 
tgcctcagat 
agctttccct 
tcaaagggtg 
acgtgcgcaa 
gatttagccc 
cccggctgta 
tgttgaggcc 
tatcaatgat 



ccgcgaacgg 
gggcagtaac 
aggtgctggc 
gcggcctgcc 
tttttacctt 
cgataaaccc 
gaattggacc 
agaggatgaa 
aatatatctt 
gcagcgcgct 
tgcttgaaac 
ctccaactta 
ccaccgattt 
tcaggttatg 
tcaggcggga 
acagcaggct 
caaccgt ctt 
cgacatagcc 
tgcgcgaggt 
aacgcccata 
tttctggtgc 



cctcacccca 

gggatgggcg 

atcgacattc 

cttcacttcg 

gggcattctt 

agcgaaccat 

tttacagaat 

gaggatgagg 

ttatatagaa 

tatcaatata 

ccaggacaat 

ttgatagtgt 

tgagaacgac 

ccgctcaatt 

ttcatacagc 

cataagacgc 

ccggagactg 

ccactgttcg 

taccgactgc 

atgcgggctg 

gtaccgggtt 



aaaatggcac 

atcagcccga 

agcgaccagg 

gccgccgggg 

ggcatagtgg 

ttgaggtgat 

tactctatga 

aggcagattg 

gatatcgccg 

tctatagaat 

aaccttatag 

tttatgttca 

agcgacttcc 

cgctgcgtat 

ggccagccat 

cccagcgtcg 

tcatacgcgt 

tccatttccg 

ggcctgagtt 

ttgcccggca 

gagaagcggt 



cgctgccact 

gcgcga cgcc 

tgccggccag 

cattcacgga 

tcgcgggtgc 

aggtaagatt 

agcgccatat 

ccttgaatat 

tatgtaagga 

gggcaaagca 

cttgt aaatt 

gataatgccc 

gtcccagccg 

atcgcttgct 

ccgtcatcca 

ccatagtgcg 

aaaacagcca 

cgcagacgat 

ttttaagtga 

tccaacgcca 

gtaagtgaac 



12360 

12420 

12430 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 
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tgcagttgcc atgttttacg gcagtgagag cagagatagc gctgatgtcc ggcgg-tgcct 13620 

ttgccgttac gcaccacccc gtcagtagct gaacaggagg gacagctgat agacacagaa 13680 

gccactggag cacctcaaaa acaccatcat acactaaatc agtaagttgg cagcatcacc 13740 

cataattgtg gtttcaaaat cggctccgtc gatactatgt tatacgccaa ctttgaaaac 13800 

aactttgaaa aagctgtttt ctggtattta aggttttaga atgcaaggaa cagtgaatt-g 13860 

gagttcgtct tgttataatt agcttcttgg ggtatcttta aatactgtag aaaagaggaa 13920 

ggaaataata aatggctaaa atgagaatat caccggaatt gaaaaaactg atcgaaaaat 13980 

accgctgcgt aaaagatacg gaaggaatgt ctcctgctaa ggtatataag ctggtgggag 14040 

aaaatgaaaa cctatattta aaaatgacgg acagccggta taaagggacc acctatgatg 14100 

tggaacggga aaaggacatg atgctatggc tggaaggaaa gctgcctgtt ccaaaggtcc 14 160 

tgcactttga acggcatgat ggctggagca atctgctcat gagtgaggcc gatggcgtcc 14220 

tttgctcgga agagtatgaa gatgaacaaa gccctgaaaa gattatcgag ctgtatgcgg 14280 

agtgcatcag gctctttcac tccatcgaca tatcggattg tccctatacg aatagcttag 14340 

acagccgctt agccgaattg gattacttac tgaataacga tctggccgat gtggattgcg 14400 

aaaactggga agaagacact ccatttaaag atccgcgcga gctgtatgat tttttaaaga 14460 

cggaaaagcc cgaagaggaa cttgtctttt cccacggcga cctgggagac agcaacatct 14520 

ttgtgaaaga tggcaaagta agtggcttta ttgatcttgg gagaagcggc agggcggaca 14580 

agtggtatga cattgccttc tgcgtccggt cgatcaggga ggatatcggg gaagaacagt 14640 

atgtcgagct attttttgac ttactgggga tcaagcctga ttgggagaaa ataaaatatt 14700 

atattttact ggatgaattg ttttagtacc tagatgtggc gcaacgatgc cggcgacaag 14760 

caggagcgca ccgacttctt ccgcatcaag tgttttggct ctcaggccga ggcccacggc 14820 
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aagtatttgg gcaaggggtc gctggtattc gtgcagggca agattcggaa taccaagtac 1 4 SSO 

gagaaggacg gccagacggt ctacgggacc gacttcattg ccgatsaggt ggattatctg 14340 

gacaccaagg caccaggcgg gtcaaatcag gaataagggc acattgcccc ggcgtgagtc 15000 

ggggcaatcc cgcaaggagg gtgaatgaat cggacgtttg accggaaggc atacaggcaa 15060 

gaactgatcg acgcggggtt ttccgccgag gatgccgaaa ccatcgcaag ccgcaccgtc 15120 

atgcgtgcgc cccgcgaaac cttccagtcc gtcggctcga tggtccagca agctacggcc 15180 

aagatcgagc gcgacagcgt gcaactggct ccccctgccc tgcccgcgcc atcggccgcc 15240 

gtggagcgtt cgcgtcgtct cgaacaggag gcggcaggtt tggcgaagtc gatgaccatc 15300 

gacacgcgag gaactatgac gaccaagaag cgaaaaaccg ccggcgagga cctggcaaaa 15360 

caggtcagcg aggccaagca ggccgcgttg ctgaaacaca cgaagcagca gatcaaggaa 15420 

atgcagcttt - ccttgttcga tattgcgccg tggccggaca cgatgcgagc gatgccaaac 15480 

gacacggccc gctctgccct gttcaccacg cgcaacaaga aaatcccgcg cgaggcgctg 15540 

caaaacaagg tcattttcca cgtcaacaag gacgtgaaga tcacctacac cggcgtcgag 15600 

ctgcgggccg acgatgacga actggtgtgg cagcaggtgt tggagtacgc gaagcgcacc 15660 

cctatcggcg agccgatcac cttcacgttc tacgagcttt gccaggacct gggctggtcg 15720 

atcaatggcc ggtattacac gaaggccgag gaatgcctgt cgcgcctaca ggcgacggcg 15780 

atgggcttca cgtccgaccg cgttgggcac ctggaatcgg tgtcgctgct gcaccgcttc 15840 

cgcgtcctgg accgtggcaa gaaaacgtcc cgttgccagg tcctgatcga cgaggaaatc 15900 

gtcgtgctgt ttgctggcga ccactacacg aaattcatat gggagaagta ccgcaagctg 15960 

tcgccgacgg cccgacggat gttcgactat ttcagctcgc accgggagcc gtacccgctc 16020 

aagctggaaa ccttccgcct catgtgcgga tcggattcca cccgcgtgaa gaagtggcgc 16080 
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gagcaggtcg gcgaagcctg 



cgaagagttg 



cgaggcagcg gcctggtgga acacgcctgg 



16140 



gtcaatgatc acctggtgca 



ttgcaaacgc 



tagggccttg tggggtcagt tccggctggg 



16200 



ggttcagcag ccagcgcttt 



actggcattt 



caggaacaag cgggcactgc tcgacgcact 



16260 



tgcttcgctc agtatcgctc 



gggacgcacg 



gcgcgctcta cgaactgccg ataaacagag 



16320 



gattaaaatt gacaattgtg attaaggctc agattcgacg gcttggagcg gccgacgtgc 16380 

aggatttccg cgagatccga ttgtcggccc tgaagaaagc tccagagatg ttcgggtccg 16440 

tttacgagca cgaggagaaa aagcccatgg aggcgttcgc tgaacggttg cgagatgccg 16500 

tggcattcgg cgcctacatc gacggcgaga tcattgggct gtcggtcttc aaacaggagg 16560 

acggccccaa ggacgctcac aaggcgcacc tgtccggcgt tttcgtggag cccgaacagc 16620 

gaggccgagg ggtcgccggt atgctgctgc gggcgttgcc ggcgggttta ttgctcgtga 16680 

tgatcgtccg acagattcca acgggaatct ggtggatgcg catcttcatc ctcggcgcac 16740 

ttaatatttc gctattctgg agcttgttgt ttatttcggt ctaccgcctg ccgggcgggg 16800 

tcgcggcgac ggtaggcgct gtgcagccgc tgatggtcgt gttcatctct gccgctctgc 16860 

taggtagccc gatacgattg atggcggtcc tgggggctat ttgcggaact gcgggcgtgg 16920 

cgctgttggt gttgacacca aacgcagcgc tagatcctgt cggcgtcgca gcgggcctgg 16980 

cgggggcggt ttccatggcg ttcggaaccg tgctgacccg caagtggcaa cctcccgtgc 17040 

ctctgctcac ctttaccgcc tggcaactgg cggccggagg acttctgctc gttccagtag 17100 

ctttagtgtt tgatccgcca atcccgatgc ctacaggaac caatgttctc ggcctggcgt 17160 

ggctcggcct gatcggagcg ggtttaacct acttcctttg gttccggggg atctcgcgac 17220 

tcgaacctac agttgtttcc ttactgggct ttctcagccc cagatctggg gtcgatcagc 17280 

cggggatgca tcaggccgac agtcggaact tcgggtcccc gacctgtacc attcggtgag 17340 
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caatggatag gggagttgat atcgtcaacg ttcacttcta aagaaatagc gccactcagc 17400 

ttccicagcg gctttatcca gcgatttcct attatgtcgg catagttctc aagatcgaca 17460 

gcctgtcacg gtcaagcgag aaatgaataa gaaggctgat aattcggatc tctgcgaggg 17520 

agatgataut tgatcacagg cagcaacgct ctgtcatcgt tacaatcaac atgctaccct 17530 

ccgcgagatc atccgtgttt caaacccggc agcttagttg ccgttcttcc gaatagcatc 17640 

ggtaacatga gcaaagtctg ccgccttaca acggctctcc cgctgacgcc gtcccggact 17700 

gatgggctgc ctgtatcgag tggtgatttt gtgccgagct gccggtcggg gagctgttgg 17760 

ctggctggtg gcaggatata ttgtggtgta aacaaattga cgcttagaca acttaataac 17820 

acattgcgga cgtttttaat gtactggggt ggtttttctt ttcaccagtg agacgggcaa 17880 

cagctgattg cccttcaccg cctggccctg agagagttgc agcaagcggt ccacgctggt 17940 

ttgccccagc aggcgaaaat cctgtttgat ggtggttccg aaatcggcaa aatcccttat 18000 

aaatcaaaag aatagcccga gatagggttg agtgttgttc cagtttggaa caagagtcca 18060 

ctattaaaga acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc 18120 

ccactacgtg aaccatcacc caaatcaagt tttttggggt cgaggtgccg taaagcacta 18180 

aatcggaacc ctaaagggag cccccgattt agagcttgac ggggaaagcc ggcgaacgtg 18240 
gcgagaaagg aagggaagaa agcgaaagga gcgggcgcca ttcaggctgc gcaactgttg 18300 
ggaagggcga tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc 18360 
tgcaaggcga ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac 18420 
ggccagtgaa ttcgagctcg gtacccggg 18449 



<210> 50 
<211> 18617 
<212> DNA 
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<2I3> Artificial 
<220> 

<223> Piasrnid 
<220> 

<22i> misc_f eature 

<222> (10264 ) . . (10264 ) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (10472) . . (10472) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (10563) . . (10563) 

<223> n is a, c, g, or t 

<400> 50 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 60 

aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 120 

aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 180 

ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 300 

caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 
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cttctcgaac cctcccggcc ccctaacgcg ggcctcccai ccccccaggg gctgcgcccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 

atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 
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tctrccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 
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gcctcatgtg cggatcggat tccacccgcg tgaagaagrg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttacrggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4 920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 5460 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 
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cgacact egg aacttegggr ccccgacctg taccattegg tgagcaatgg ataggggagt 5700 

tgatatcgtc aaccttcact tctaaagaaa tagcgccact eager recce agegacttta 5760 

tecagegatt tcctattatg teggcatagt tctcaagatc gaeagectgt cacggttaag 5S20 

cgagaaatga araagaaggc tgataattcg gatctctgeg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgeta ccctccgcga gatcatccgt 5940 

gtttcaaacc eggcagctta gttgccgttc ttccgaatag categgtaac atgagcaaag 6000 

tctgccgcct tacaaegget ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agetgeeggt eggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgaegctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggega 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggega aaaacegtet ateagggega tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt geegtaaage actaaategg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa ageeggegaa cgtggcgaga aaggaaggga 6600 

agaaagegaa aggageggge gecattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaaege cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa ataegtcgag cctgctccgc ttggaagcgg 6840 

egaggagect cgtcctgtca caactaccaa catggagtac gataagggee agttccgcca 6900 
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gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tctacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgcrcgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 
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acagctctgc cgcctctgac gtgcagtgga tgattattaa tcccggaccg gccgcccctc 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 8460 

gcgtaagctc cctaartggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt ■ 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 90.60 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 
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ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 94S0 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtc.tg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 
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atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt gagattaaaa 10800 

tagataagga aaagaaagtg aaaagaaatt cggaagcatg gcacattctt ctttttataa 10860 

atacatgcct gactttcttt ttccatcgat atgatatatg catatgatag atatacaagc 10920 

aatcttcttc aaggagtttg aaattttgtc ctccaggagc aaaaaaaagt ttttttttat 10980 

acatgtttgt acacaagaat agttaccaat ttgctttggt cttacgtgct gcaagtttat 11040 

atcgttttca atttct ttgt ctttacattt tctttgtcct ttatctttcc tcatttagtc 11100 

tttgggagaa ttaggaaaag ggagcggaaa ggtaagaaat gcttgcgtat tttactaatt 11160 

cggcaaacat ccaatttggc aaacagcagc ctgtgcaacg ctctcgagat gacagtatct 11220 

ttgattacac tctaaatctc gatgacccga ccaaaaagag cgaacaaaga aataatcttg 11280 

tgcattcgaa tatgatggaa gattttttcc cccttattct aaatgttgac atagcgtgta 11340 

tgttatataa acaaaaagaa attgtacaaa ctttcttttc ttctcttttt attttatctc 11400 

tatgctgtcg aagctgcagt caatcagcgt caaggcccgc cgcgttgaac tagcccgcga 114 60 

catcacgcgg cccaaagtct gcctgcatgc tcagcggtgc tcgttagttc ggctgcgagt 11520 

ggcagcacca cagacagagg aggcgctggg aaccgtgcag gctgccggcg cgggcgatga 11580 

gcacagcgcc gatgtagcac tccagcagct tgaccgggct atcgcagagc gtcgtgcccg 11640 

gcgcaaacgg gagcagctgt cataccaggc tgccgccatt gcagcatcaa ttggcgtgtc 11700 

aggcattgcc atcttcgcca cctacctgag atttgccatg cacatgaccg tgggcggcgc 11760 

agtgccatgg ggtgaagtgg ctggcactct cctcttggtg gttggtggcg cgctcggcat 11820 

ggagatgtat gcccgctatg cacacaaagc catctggcat gagtcgcctc tgggctggct 11880 

gctgcacaag agccaccaca cacctcgcac tggacccttt gaagccaacg acttgtttgc 11940 
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aatcatcaat ggactgcccg ccatgctcct ctgtaccttt cgcttctgcc tqcccaacgt 12000 

cctggccgcg cccLgctttg gagcggggct gggcatcacg ctataccgca tcccaratat 12060 

gtttgtacac gatggcctgg tgcacaggcg ctttcccacc gggcccatcg ctggcctgcc 12120 

ctacatgaag cgcctgacag tggcccacca gctacaccac agcggcaagt acggtggcgc 12130 

gccctggggt argttcttgg gtccacagga gctgcagcac attccaggtg cggcggagga 12240 

ggrggagcga ctggtcctgg aactggactg gtccaagcgg tagaagcttg agattaaaat 12300 

agataaggaa aagaaagtga aaagaaattc ggaagcatgg cacattcttc tttttataaa 12360 

tacatgcctg actttctttt tccatcgata tgatatatgc atatgataga tatacaagca 12420 

atcttcttca aggagtttga aattttgtcc tccaggagca aaaaaaagtt tttttttata 12480 

catgtttgta cacaagaata gttaccaatt tgctttggtc ttacgtgctg caagtttata 12540 

tcgttttcaa tttctttgtc tttacatttt ctttgtcctt tatctttcct catttagtct 12600 

ttgggagaat taggaaaagg gagcggaaag gtaagaaatg cttgcgtatt ttactaattc 12660 

ggcaaacatc caatttggca aacagcagcc tgtgcaacgc tctcgagatg acagtatctt' 12720 

tgattacact ctaaatctcg atgacccgac caaaaagagc gaacaaagaa ataatcttgt 12780 

gcattcgaat atgatggaag attttttccc ccttattcta aatgttgaca tagcgtgtat 12840 

gttatataaa caaaaagaaa ttgtacaaac tttcttttct tctcttttta ttttatctct 12900 

atgatccagt tagaacaacc actcagtcat caagcaaaac tgactccagt actgagaagt 12960 

aaatctcagt ttaaggggct tttcattgct attgtcattg ttagcgcatg ggtcattagc 13020 

ctgagtttat tactttccct tgacatctca aagctaaaat tttggatgtt attgcctgtt 13080 

atactatggc aaacattttt atatacggga ttatttatta catctcatga tgccatgcat 13140 

ggcgtagtat ttccccaaaa caccaagatt aatcatttga ttggaacatt gaccctatcc 13200 
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ctttcuggtc ttttaccata tcaaaaacta ttgaaaaaac attggttaca ccaccacaat 13260 

ccagcaagci caatagaccc ggattttcac aatggtaaac accaaagttt ccttgctrgg 13320 

tattttcatt ttatgaaagg ttactggagt tgggggcaaa taattgcgtt gactattatt 13380 

tataactttg ctaaatacat actccatatc ccaagtgata atctaactta cttttgggtg 13440 

ctaccctcgc ttttaagttc attacaatta ttctattttg gtactttttt accccatagt 13500- 

gaaccaatag ggggttatgt tcagcctcat tgtgcccaaa caattagccg tcctatttgg 13560 

tggtcattta tcacgtgcta tcattttggc taccacgagg aacatcacga atatcctcat 13620 

atttcttggt ggcagttacc agaaatttac aaagcaaaat agaagcttgg cgtaatcatg 13680 

gtcatagctg tttcctgtgt gaaattgtta tccgctcaca attccacaca acatacgagc 13740 

cggaagcata aagtgtaaag cctggggtgc ctaatgagtg agctaactca cattaattgc 13800 

gttgcgctca ctgcccgctt tccagtcggg aaacctgtcg tgccagctgc attaatgaat 13860 

cggccaacgc gcggggagag gcggtttgcg tattgggcca aagacaaaag ggcgacattc 13920 

aaccgattga gggagggaag gtaaatattg acggaaatta ttcattaaag gtgaattatc 13980 

accgtcaccg acttgagcca tttgggaatt agagccagca aaatcaccag tagcaccatt 14040 

accattagca aggccggaaa cgtcaccaat gaaaccatcg atagcagcac cgtaatcagt 14100 

agcgacagaa tcaagtttgc ctttagcgtc agactgtagc gcgttttcat cggcattttc 14160 

ggtcatagcc cccttattag cgtttgccat cttttcataa tcaaaatcac cggaaccaga 14220 

gccaccaccg gaaccgcctc cctcagagcc gccaccctca gaaccgccac cctcagagcc 14280 

accaccctca gagccgccac cagaaccacc accagagccg ccgccagcat tgacaggagg 14340 

cccgatctag taacatagat gacaccgcgc gcgataattt atcctagttt gcgcgctata 14400 

ttttgttttc tatcgcgtat taaatgtata attgcgggac tctaatcata aaaacccatc 14460 
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tcataaataa cgtcatgcat tacatgttaa ttattacatg ctraacgtaa ttcaacagaa 14520 

attatatgat aatcatcgca agaccggcaa caggattcaa tcttaagaaa ctttattgcc 145S0 

aaatgtttga acgatcgggg atcatccggg tctgtggcgg gaactccacg aaaatatccg 14640 

aacgcagcaa gatatcgcgg tgcatctcgg tcttgcctgg gcagtcgccg ccgacgccgt 14700 

tgatgtggac gccgggcccg atcatattgt cgctcaggat cgtggcgttg rgcttgtcgg 14760 

ccgttgctgt cgtaatgata tcggcacctt cgaccgcctg ttccgcagag atcccgtggg 14820 

cgaagaactc cagcatgaga tccccgcgct ggaggatcat ccagccggcg tcccggaaaa 14880 

cgattccgaa gcccaacctt tcatagaagg cggcggtgga atcgaaatct cgtgatggca 14940 

ggttgggcgt cgcttggtcg gtcatttcga accccagagt cccgctcaga agaactcgtc 15000 

aagaaggcga tagaaggcga tgcgctgcga atcgggagcg gcgataccgt aaagcacgag 15060 

gaagcggtca gcccattcgc cgccaagctc ttcagcaata tcacgggtag ccaacgctat 15120 

gtcctgatag cggtccgcca cacccagccg gccacagtcg atgaatccag aaaagcggcc 15180 

attttccacc atgatattcg gcaagcaggc atcgccatgg gtcacgacga gatcatcgcc 15240 

gtcgggcatg cgcgccttga gcctggcgaa cagttcggct ggcgcgagcc cctgatgctc 15300 

ttcgtccaga tcatcctgat cgacaagacc ggcttccatc cgagtacgtg ctcgctcgat 15360 

gcgatgtttc gcttggtggt cgaatgggca ggtagccgga tcaagcgtat gcagccgccg 15420 

cattgcatca gccatgatgg atactttctc ggcaggagca aggtgagatg acaggagatc 15480 

ctgccccggc acttcgccca atagcagcca gtcccttccc gcttcagtga caacgtcgag 15540 

cacagctgcg caaggaacgc ccgtcgtggc cagccacgat agccgcgctg cctcgtcctg 15600 

cagttcattc agggcaccgg acaggtcggt cttgacaaaa agaaccgggc gcccctgcgc 15660 

tgacagccgg aacacggcgg catcagagca gccgattgtc tgttgtgccc agtcatagcc 15720 
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gaatagcctc tccacccaag cggccggaga acctgcctgc aatccatctt grtcaatcat 15780 

ccgaaacca: ccagatccgg tgcagattat ttggattgag agtgaatatg agactctaat 1554 0 

tggataccga ggggaattta tggaacgtca gtggagcatt tttgacaaga aatatttgcn 15900 

agctgatagt gaccttaggc gacttttgaa cgcgcaataa tggtttctga cgtatgtgct 15960 

tagctcatta aactccagaa acccgcggct gagtggctcc ttcaacgttg cggttctgtc 16020 

agttccaaac gtaaaacggc ttgtcccgcg tcatcggcgg gggtcataac gtgactccct 16080 

taattctccg ctcatgatca gattgtcgtt tcccgccttc agtttaaact atcagtgttt 16140 

gacaggatat attggcgggt aaacctaaga gaaaagagcg tttattagaa taatcggata 16200 

tttaaaaggg cgtgaaaagg tttatccytt cgtccatttg tatgtgcatg ccaaccacag 16260 

ggttccccag atctggcgcc ggccagcgag acgagcaaga ttggccgccg cccgaaacga 16320 

tccgacagcg cgcccagcac aggtgcgcag gcaaattgca ccaacgcata cagcgccagc 16380 ■ 

agaatgccat agtgggcggt gacgtcgttc gagtgaacca gatcgcgcag gaggcccggc 16440 

agcaccggca taatcaggcc gatgccgaca gcgtcgagcg cgacagtgct cagaattacg 16500 

atcaggggta tgttgggttt cacgtctggc ctccggacca gcctccgctg gtccgattga 16560 

acgcgcggat tctttatcac tgataagttg gtggacatat tatgtttatc agtgataaag 16620 

tgtcaagcat gacaaagttg cagccgaata cagtgatccg tgccgccctg gacctgttga 16680 

acgaggtcgg cgtagacggt ctgacgacac gcaaactggc ggaacggttg ggggttcagc 16740 

agccggcgct ttactggcac ttcaggaaca agcgggcgct gctcgacgca ctggccgaag 16800 

ccatgctggc ggagaatcat acgcattcgg tgccgagagc cgacgacgac tggcgctcat 16860 

ttctgatcgg gaatgcccgc agcttcaggc aggcgctgct cgcctaccgc gatggcgcgc 16920 

gcatccatgc cggcacgcga ccgggcgcac cgcagatgga aacggccgac gcgcagcttc 16980 
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gcttcctctg cgacgccggt ttttcggccg gggacgccgt caatgcgctc atgacaaica 17040 

gctactrcac tgrtggggcc gtgcttgagg agcaggccgg cgacagcgat gccggcgagc I7I00 

gcggcggcac cgttgaacag gctccgctct cgccgctgtt gcgggccgcg atagacccct 17160 

tcgacgaagc cggtccggac gcagcgttcg agcagggact cgcggtgatt gtcgatggat 17220 

tggcgaaaag gaggctcgtt gtcaggaacg ttgaaggacc gagaaagggt gacgattgat 17280 

caggaccgct gccggagcgc aacccactca ctacagcaga gccatgtaga caacatcccc 17340 

tccccctttc caccgcgtca gacgcccgta gcagcccgct acgggctttt tcatgccctg 17400 

ccctagcgtc caagcctcac ggccgcgctc ggcctctctg gcggccttct ggcgctcttc 17460 

cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc 17520 

tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat 17580 

gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt 17640 

ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg 17700 

aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc 17760 

tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt 17820 

ggcgcttttc cgctgcataa ccctgcttcg gggtcattat agcgattttt tcggtatatc 17880 

catccttttt cgcacgatat acaggatttt gccaaagggt tcgtgtagac tttccttggt 17940 

gtatccaacg gcgtcagccg ggcaggatag gtgaagtagg cccacccgcg agcgggtgtt 18000 

ccttcttcac tgtcccttat tcgcacctgg cggtgctcaa cgggaatcct gctctgcgag 18060 

gctggccggc taccgccggc gtaacagatg agggcaagcg gatggctgat gaaaccaagc 18120 

caaccaggaa gggcagccca cctatcaagg tgtactgcct tccagacgaa cgaagagcga 18180 

ttgaggaaaa ggcggcggcg gccggcatga gcctgtcggc ctacctgctg gccgtcggcc 18240 



BASF NAE 878/03 
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ecccctacaa aatcacgggc gtcgtggact atgagcacgt ccccgacctg gcccgcatca 18300 

atggccacct gggccgcctg ggcggcctgc tgaaactctg gcicacccac cacccccgca 18360 

cggcgcggrt cggtgatgcc acgatcctcg ccctgctggc gaagatcgaa gagaagcagg 18420 

acgagcttgg caaggtcatg atgggcgtgg tccgcccgag ggcagagcca tgactttttt 18480 

agccgctaaa acggccgggg ggtgcgcgtg att'gccaagc acgtccccat gcgctccatc 18540 

aagaagagcg acttcgcgga gctggtgaag tacatcaccg acgagcaagg caagaccgag 18600 

cgcctttgcg acgctca 18617 

<210> 51 

<211> 18333 

<212> DNA 

<213> Artificial 

<220> 

<223> Piasmid 
<220> 

<221> misc_f eature 

<222> (10264) . . (10264) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (10472) . . (10472) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10563) . . (10563) 

<223> n is a, c, g, or t 



<400> 51 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 



60 
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aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga taccicccgg 120 

aaaacttggc cctcactgac agatgagggg cggacgttga cactrgaggg gccgactcac 180 

ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgangrgga 300 

caagcctggg garaagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 

atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 
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atgactccaa cttattgata gtgttttarg ttcagataai gcccgatgac tttgtcatgc 1380 

agciccaccg atttrgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgatrac gigcagcLii 1500 

cccttcaggc gggattcata cagcggccag ccatccgica tccatatcac cacgtcaaac 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 

tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 

ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa' ggaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 2460 

catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcacn ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 
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tgaagatgaa caaagccctg aaaagattat cgagctgtat ccggagtgca rcaggctctr 2640 

tcactccatc cacatatcgg attgtcccta tacgaatagc ttagacagcc ccttacccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact cggaagaaga 2760 

cactccantr aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2S40 

cttctgcgrc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 
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ccctgttcac cacccgcaac aagaaaatcc cgcgcgaggc gctgcaaaac eager: cat tt 3900 

tccacgtcaa caacgacgic aagatcacct acaccggcgt egagctgegg gecgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt aegegaageg cacccctatc ggcgacccga 4020 

tcaccttcac gttccacgag etttgecagg acctgggctg gtcgatcaat ggceggratt 4080 

acacgaaggc egaggaatge ctgtcgcgcc tacaggegae ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac- gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcacccgg agccgtaccc gctcaagctg gaaaccttcc 4380 

gcctcatgtg eggateggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

ectgegaaga gttgegagge ageggectgg tggaacaege ctgggtcaat gatgacctgg 4500 

tgcattgcaa aegctaggge cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc atttcaggaa caagegggea ctgctcgacg cacttgcttc gctcagtatc 4620 

getegggacg cacggcgcgc tctacgaact gecgataaac agaggattaa aattgacaat 4680 

tgtgattaag gctcagattc gaeggcttgg agcggccgac gtgeaggatt tecgegagat 4740 

ccgattgtcg gecctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagece atggaggcgt tegctgaacg gttgegagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggaeggee ccaaggacgc 4920 

teacaaggeg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtege 4980 

eggtatgetg ctgcgggcgt tgccggcggg tttattgetc gtgatgatcg tccgacagat 5040 

tecaaeggga atctggtgga tgegcatett catcctcggc gcacttaata tttegctatt 5100 
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crggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggcacg 5160 

cgctgrgcag ccgcrgatgg tcgtgttcat ctctgcccci ctgctaggta gcccgatacg 5220 

attgatggcg grcctggggg ctatttgcgg aactgcgggc gtggcgctgr nggrgtit gac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgrtcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 5460 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6.000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 



BASF NAE 878/03 31 1/357 

ccgagatagg gttgagrgtt g^rccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgu caaagggcga aaaaccgtct atcagggcga tggcccacra cgtgaaccsi 6480 

cacccaaatic aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaacrgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 
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tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagrcacac aacacaagc: 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg - 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 8460 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 
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tttgaacctt tcagttcgag ctttcccact tcatcgcacc ttcactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagr ttctgatcga 9000 

aaagrtcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccganggtrt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattc.gg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 
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gtcccttcta gtgatttaat agctccatgt caacaacaat aaaacgcctt ttcgcgttta 10200 

cctcitccac aracagctca tctgcaatgc attaatgcat tgactgcaac cuagtaaccc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ccgaggaatc cgctcrtggc 10360 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttacrctg 10440 

atagctigac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgrt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta a.ccatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt gagattaaaa 10800 

tagataagga aaagaaagtg aaaagaaatt cggaagcatg gcacattctt ctttttataa 10860 

atacatgcct gactttcttt ttccatcgat atgatatatg catatgatag atatacaagc 10920 

aatcttcttc aaggagtttg aaattttgtc ctccaggagc aaaaaaaagt ttttttttat 10980 

acatgtttgt acacaagaat agttaccaat ttgctttggt cttacgtgct gcaagtttat 11040 

atcgttttca atttctttgt ctttacattt tctttgtcct ttatctttcc tcatttagtc 11100 

tttgggagaa ttaggaaaag ggagcggaaa ggtaagaaat gcttgcgtat tttactaatt 11160 

cggcaaacat ccaatttggc aaacagcagc ctgtgcaacg ctctcgagat gacagtatct 11220 

ttgattacac tctaaatctc gatgacccga ccaaaaagag cgaacaaaga aataatcttg 11280 

tgcattcgaa tatgatggaa gattttttcc cccttattct aaatgttgac atagcgtgta 11340 

tgttatataa acaaaaagaa attgtacaaa ctttcttttc ttctcttttt attttatctc 11400 
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tccagtactg aceagtaaai ctcagtttaa ggggcttttc attgctattg tcattgttag 12720 

cgcatgggrc artaccctca gtttattact ttcccttgac atctcaaagc taaaattttc 127 80 

gatgctattg cctgrtatac tatggcaaac atttttatat acgggattat ttattacatc 12S40 

tcatgatgcc atgcarggcg tagtatttcc ccaaaacacc aagattaatc atttgattgg 12900 

aacatrgacc ctatcccttt atggtctttt accatatcaa aaactattga aaaaacattg 12960 

gttacaccac cacaatccag caagctcaat agacccggat tttcacaatg gtaaacacca 13020 

aagtttcttr gcttggtatt ttcattttat gaaaggttac tggagttggg ggcaaataat 13080 

tgcgttgact attatttata actttgctaa atacatactc catatcccaa gtgataatct 13140 

aacttacttt tgggtgctac cctcgctttt aagttcatta caattattct attttggtac 13200 

ttttttaccc catagtgaac caataggggg ttatgttcag cctcattgtg cccaaacaat 13260 

tagccgtcct atttggtggt catttatcac gtgctatcat tttggctacc acgaggaaca 13320 

tcacgaatat cctcatattt cttggtggca gttaccagaa atttacaaag caaaatagaa 13380 

gcttggcgta atcatggtca tagctgtttc ctgtgtgaaa ttgttatccg ctcacaattc 13440 

cacacaacat acgagccgga agcataaagt gtaaagcctg gggtgcctaa tgagtgagct 13500 

aactcacatt - aattgcgttg cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc 13560 

agctgcatta atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt gggccaaaga 13620 

caaaagggcg acattcaacc gattgaggga gggaaggtaa atattgacgg aaattattca 13680 

ttaaaggtga attatcaccg tcaccgactt gagccatttg ggaattagag ccagcaaaat 13740 

caccagtagc accattacca ttagcaaggc cggaaacgtc accaatgaaa ccatcgatag 13800 

cagcaccgta atcagtagcg acagaatcaa gtttgccttt agcgtcagac tgtagcgcgt 13860 

tttcatcggc attttcggtc atagccccct tattagcgtt tgccatcttt tcataatcaa 13920 
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aa tcaccgga 
cgccaccctc 
cagcattgac 
tagtttgcgc 
atcataaaaa 
acgtaattca 
aagaaacttt 
tccacgaaaa 
t cgccgccga 
gcgttgtgct 
gcagagatcc 
ccggcgtccc 
aaatctcgtg 
ctcagaagaa 
taccgtaaag 
gggtagccaa 
atccagaaaa 
cgacgagatc 
cgagcccctg 
tacgtgctcg 
gcgtatgcag 



accagagcca 
agagccacca 
aggaggcccg 
get atatttt 
cccatctcat 
acagaaatta 
attgecaaat 
tatccgaacg 
cgccgttgat 
tgtcggccgt 
cgtgggcgaa 
ggaaaacgat 
atggcaggtt 
ctegtcaaga 
cacgaggaag 
cgctatgtcc 
gcggccattt 
atcgccgtcg 
atgetctteg 
etcgatgega 
ccgccgcatt 



ccaccggaac 
ccctcagagc 
atctagtaac 
gttttctatc 
aaataacgtc 
tatgataatc 
gtttgaacga 
cagcaagata 
gtggacgccg 
tgctgtcgta 
gaactccagc 
tccgaagccc 
gggegtcget 
aggegataga 
cggtcagccc 
tgatagcggt 
tccaccatga 
ggcatgcgcg 
tccagatcat 
tgtttcgctt 
gcatcagcca 



cgcctccctc 
cgccaccaga 
atagatgaca 
gegtat laaa 
atgeattaca 
ategcaagae 
teggggatea 
tegeggtgea 
ggcccgatca 
atgatategg 
atgagatccc 
aacctttcat 
tggteggtea 
aggegatgeg 
attcgccgcc 
ccgccacacc 
tatteggcaa 
ccttgagcct 
cctgatcgac 
ggtggtcgaa 
tgatggatac 



agagccacca 

accaccacca 

ccgcgcgcga 

tgtataattg 

tgttaattat 

eggcaacagg 

teegggtctg 

teteggtett 

tattgtcget 

caccttcgac 

cgcgctggag 

agaaggegge 

tttcgaaccc 

ctgegaateg 

aagctcttca 

cagccggcca 

gcaggcatcg 

ggcgaacagt 

aagacegget 

tgggcaggta 

tttcteggea 



ccct cagaac 

gagccgccgc 

iaat itatcc 

cgggactcia 

tacatgetta 

attcaatctt 

tggegggaac 

gcctgggcag 

caggategtg 

cgcctgttcc 

gatcatccag 

ggtggaatcg 

cagagtcccg 

ggageggega 

gcaatatcac 

cagtcgatga 

ccatgggtca 

tcggctggcg 

tccatccgag 

geeggatcaa 

ggagcaaggt 



13 930 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 
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cagatgacac gagatcctgc cccggcactt cgcccaatag cacccactcc cttcccgcr: 15240 

cagt:gacaac gtcgagcaca gctgcgcaag gaacgcccgt cgtggccagc caccaiaccc 15300 

gcgctgcctc gtcctgcagt tcattcaggg caccggacag gtcggtcttg acaaaaagaa 15360 

ccgggcgccc ctgcgctgac agccggaaca cggcggcatc agagcagccg attgtctgtt 15420 

gtgcccagtc atagccgaat agcctctcca cccaagcggc cggagaacct gcgtgcaatc 15480 

catcttgttc aatcatgcga aacgatccag atccggtgca gattatttgg attgagagtg 15540 

aatatgagac tctaattgga taccgagggg aatttatgga acgtcagtgg agcatttttg 15600 

acaagaaata tttgctagct gatagtgacc ttaggcgact tttgaacgcg caataatggt 15660 

ttctgacgta tgtgcttagc tcattaaact ccagaaaccc gcggctgagt ggctccttca 15720 

acgttgcggt tctgtcagtt ccaaacgtaa aacggcttgt cccgcgtcat cggcgggggt 15780 

cataacgtga ctcccttaat tctccgctca tgatcagatt gtcgtttccc gcctt cagt t ■ 15840 

taaactatca gtgtttgaca ggatatattg gcgggtaaac ctaagagaaa agagcgttta 15900 

ttagaataat cggatattta aaagggcgtg aaaaggttta tccgttcgtc catttgtatg 15960 

tgcatgccaa ccacagggtt ccccagatct ggcgccggcc agcgagacga gcaagattgg 16020 

ccgccgcccg aaacgatccg acagcgcgcc cagcacaggt gcgcaggcaa attgcaccaa 16080 

cgcatacagc gccagcagaa tgccatagtg ggcggtgacg tcgttcgagt gaaccagatc 16140 

gcgcaggagg cccggcagca ccggcataat caggccgatg ccgacagcgt cgagcgcgac 16200 

agtgctcaga attacgatca ggggtatgtt gggtttcacg tctggcctcc ggaccagcct 16260 

ccgctggtcc gattgaacgc gcggattctt tatcactgat aagttggtgg acatattatg 16320 

tttatcagtg ataaagtgtc aagcatgaca aagttgcagc cgaatacagt gatccgtgcc 16380 

gccctggacc tgttgaacga ggtcggcgta gacg.gtctga cgacacgcaa actggcggaa 16440 
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cggttggggg ttcagcagcc ggcgctttac tggcacttca cgaacaaccg ggccctgctc 16500 

gacgcactgg ccgaagccat gctggcggag aatcatacgc attcggtgcc gagagccgac 16560 

gacgactggc gctcatttct gatcgggaat gcccgcagct tcaggcaggc gctgcicgcc 16620 

taccgcgatg gcgcgcgcat ccatgccggc acgcgaccgg gcgcaccgca gatggaaacg 16680 

gccgacgcgc agcttcgctt cctctgcgag gcgggttttt cggccgggga cgccgtcaat 16740 

gcgctgatga caatcagcta cttcactgtt ggggccgtgc ttgaggagca ggccggcgac 16800 

agcgatgccg " gcgagcgcgg cggcaccgtt gaacaggctc cgctctcgcc gctgttgcgg 16860 

gccgcgatag acgccttcga cgaagccggt ccggacgcag cgttcgagca gggactcgcg • 16920 

gtgattgtcg atggattggc gaaaaggagg ctcgttgtca ggaacgttga aggaccgaga 16980 

aagggtgacg attgatcagg accgctgccg gagcgcaacc cactcactac agcagagcca 17040 

tgtagacaac atcccctccc cctttccacc gcgtcagacg cccgtagcag cccgctacgg 17100 

gctttttcat gccctgccct agcgtccaag cctcacggcc gcgctcggcc tctctggcgg 17160 

ccttctggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 17220 

ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 17280 

acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 17340 

cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 17400 

caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 174 60 

gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 17520 

tcccttcggg aagcgtggcg cttttccgct gcataaccct gcttcggggt cattatagcg 17580 

attttttcgg tatatccatc ctttttcgca cgatatacag gattttgcca aagggttcgt 17640 

g.tagactttc cttggtgtat ccaacggcgt cagccgggca ggataggtga agtaggccca 17700 
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ccccccagcg ggt gt rcctt cttcactgtc ccttattcgc acctggcggt gcrcaacggg 17760 

aatcctgctc tgcgaggctg gccggctacc gccggcgtaa cagatgacgg caagcggatg 17320 

gctgatgaaa ccaagccaac caggaagggc agcccaccta tcaaggtgta ctgccttcca 17880 

gacgaacgaa gagcgattga ggaaaaggcg gcggcggccg gcatgagcct gtcggcctac 17940 

ctgctggccg rcggccaggg ctacaaaatc acgggcgtcg tggactatga gcacgtccgc 18000 

gagctggccc gcatcaatgg cgacctgggc cgcctgggcg gcctgctgaa actctggctc 18060 

accgacgacc cgcgcacggc gcggttcggt gatgccacga tcctcgccct gctggcgaag 18120 

atcgaagaga agcaggacga gcttggcaag gtcatgatgg gcgtggtccg cccgagggca 18180 

gagccatgac ttttttagcc gctaaaacgg ccggggggtg cgcgtgattg ccaagcacgt 18240 

ccccatgcgc tccatcaaga agagcgactt cgcggagctg gtgaagtaca tcaccgacga 18300 

gcaaggcaag accgagcgcc tttgcgacgc tea 18333 



<210> 52 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<220> 

<221> misc_f eature 

<222> (3) . . (3) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (9) . . (9) 

<223> n is a, c, g, or t 
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<400> 52 

gcngarggna rhtggta 

<210> 53 

<21I> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 
<220> 

<221> misc_f eature 

<222> (3) . . (3) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (6) . . (6) 

<223> n is a, c, g, or t 

<400> 53 

tcngcnagra adatrttrtg 

<210> 54 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 54 

aagtgacacc ggttacacgc ttgtctt 



<210> 
<211> 
<212> 



55 
27 
DNA 
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<213> Artificial 
<220> 

<223> Primer 

<400> 55 

gcttatcacc atctgttacc tccttgc 

<2i0> 56 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 56 

agagagggat ccttaaatgc gaatatcgtt gc 

<210> 57 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 57 

agagagggat ccatgtctga tcaaaagaag ca 

<210> 58 

<211> 37 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 58 
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actttattgg atccrcaaat gcgaatatcg ttgctgc 

<210> 59 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 
<400> 59 

gttccaattg gccacatgaa gagtaagaca ggaaacag 

<210> 60 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 
<400> 60 

cctgtcttac tcttcatgtg' gccaattgga accaacac 

<210> 61 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 
<400> 61 

ctattttaat catatgtctg atcaaaagaa gcatattg 



<210> 
<211> 
<212> 



62 

16103 
DNA 
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<2I3> Artificial 



<220> 

<223> Primer 



<220> 

<22 1 > misc_f eature 

<222> (3471) . . (3471) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (3679) . . (3679) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (3770) . . (3770) 

<223> n is a, c, q, or t 

<400> 62 

gatctttcga cactgaaata cgtcgagcct gctccgcttg gaagcggcga ggagcctcgt 

cctgtcacaa ctaccaacat ggagtacgat aagggccagt tccgccagct cattaagagc 

cagttcatgg gcgttggcat gatggccgtc atgcatctgt acttcaagta caccaacgct 

cttctgatcc agtcgatcat ccgctgaagg cgctttcgaa tctggttaag atccacgtct 

tcgggaagcc agcgactggt gacctccagc gtccctttaa ggctgccaac agctttctca 

gccagggcca gcccaagacc gacaaggcct ccctccagaa cgccgagaag aactggaggg 

gtggtgtcaa ggaggagtaa gctccttatt gaagtcggag gacggagcgg tgtcaagagg 

atattcttcg actctgtatt atagataaga tgatgaggaa ttggaggtag catagcttca 

tttggatttg ctttccaggc tgagactcta gcttggagca tagagggtcc tttggctttc 

aatattctca agtatctcga gtttgaactt attccctgtg aaccttttat tcaccaatga 
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gcettggaat gaacatgaat ctgaggactc caatcgccat gaggttttcg aaatacatcc 660 

ggatgtcgaa ggcttggggc acctgcgttg gttgaattta gaacgtggca ctaitcatca "7 20 

tccgatagct ctgcaaaggg cgttgcacaa tgcaagtcaa acgttgctag cagttccagg 780 

tggaatgtta fgatgagcat tgtattaaat caggagatat agcatgatct ctagttagct 840 

caccacaaaa gtcagacggc gtaaccaaaa gtcacacaac acaagctgta aggatttcgg 900 

cacggctacg gaagacggag aagccacctt cagtggactc gagtaccatt taattctatt S60 

tgtgtttgat cgagacctaa tacagcccct acaacgacca tcaaagtcgt atagctacca 1020 

gtgaggaagt ggactcaaat cgacttcagc aacatctcct ggataaactt taagcctaaa 1080 

ctatacagaa taagataggt ggagagctta taccgagctc ccaaatctgt ccagatcatg 1140 

gttgaccggt gcctggatct tcctatagaa tcatccttat tcgttgacct agctgattct 1200 

ggagtgaccc agagggtcat gacttgagcc taaaatccgc cgcctccacc atttgtagaa 1260 

aaatgtgacg aactcgtgag ctctgtacag tgaccggtga ctctttctgg catgcggaga 1320 

gacggacgga cgcagagaga agggctgagt aataagccac tggccagaca gctctggcgg 1380 

ctctgaggtg cagtggatga ttattaatcc gggaccggcc gcccctccgc cccgaagtgg 1440 

aaaggctggt gtgcccctcg ttgaccaaga atctattgca tcatcggaga atatggagct 1500 

tcatcgaatc accggcagta agcgaaggag aatgtgaagc caggggtgta tagccgtcgg 1560 

cgaaatagca tgccattaac ctaggtacag aagtccaatt gcttccgatc tggtaaaaga 1620 

ttcacgagat agtaccttct ccgaagtagg tagagcgagt acccggcgcg taagctccct 1680 

aattggccca tccggcatct gtagggcgtc caaatatcgt gcctctcctg ctttgcccgg 1740 

tgtatgaaac cggaaaggcc gctcaggagc tggccagcgg cgcagaccgg gaacacaagc 1800 

tggcagtcga cccatccggt gctctgcact cgacctgctg aggtccctca gtccctggta 1860 
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ggcagctttg ccccctctgt ccgcccggtg tgtcggcggg gttgacaagg tcgttgcgtc 1920 

agtccaacat zzqzzgccaz attttcctgc tctccccacc agczgctczz ttcttttctc I960 

tttcttttcc catcrtcagt atattcatct tcccatccaa gaacctttat trcccctaag 2040 

taagtacttu gctacatcca tactccatcc ttcccatccc ttattccttt gaacct ntca 2100 

gttcgagctt tcccacttca tcgcagcttg actaacagct accccgcttg agcagacatc 2160 

accatgcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa gttcgacagc 2220 

gtctccgacc tgatgcagct ctcggagggc gaagaatctc gtgctttcag cttcgatgta 2280 

ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg atggtttcta caaagatcgt 2340 

tatgtttatc ggcactttgc atcggccgcg ctcccgattc cggaagtgct tgacattggg 2400 

gaattcagcg agagcctgac ctattgcatc tcccgccgtg cacagggtgt cacgttgcaa 2460 

gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg 2520 

atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc 2580 

ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac 2640 

tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg 2700. 

atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga tttcggctcc 2760 

aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag cgaggcgatg 2820 

ttcggggatt cccaatacga ggtcgccaac atcttcttct ggaggccgtg gttggcttgt 2880 

atggagcagc agacgcgcta cttcgagcgg aggcatccgg agcttgcagg atcgccgcgg 2940 

ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt ggttgacggc 3000 

aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg atccggagcc 3060 

gggactgrcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac cgatggctgt 3120 
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gtagaactac tcgccgatag tggaaaccqa cgccccagca ctcctccgag ggcaaaggaa 31S0 

tagagtagar gccgaccgcg ggatcgatcc acttaacgtt actgaaatca tcaaacagct 3240 

tgacgaatct ggatataaga tcgttggtgt cgatgtcagc tccggagttg agacaaatgg 3300 

tgttcaggat ctcgataaga tacgttcatt tgtccaagca gcaaagagtg ccitctagtg 3360 

atttaatagc tccatgtcaa caagaataaa acgcgttttc gggtttacct cttccagata 3420 

cagctcatct gcaatgcatt aatgcattga ctgcaaccta gtaacgcctt ncaggctccg 3480 

gcgaagagaa gaatagctta gcagagctat tttcattttc gggagacgag atcaagcaga 3540 

tcaacggtcg tcaagagacc tacgagactg aggaatccgc tcttggctcc acgcgactat 3600 

atatttgtct ctaattgtac tttgacatgc tcctcttctt tactctgata gcttgactat 3660 

gaaaattccg tcaccagcnc ctgggttcgc aaagataatt gcatgtttct tccttgaact 3720 

Gtcaagccta caggacacac attcatcgta ggtataaacc tcgaaatcan ttcctactaa 3780 

gatggtatac aatagtaacc atgcatggtt gcctagtgaa tgctccgtaa cacccaatac 3840 

gccggccgaa acttttttac aactctccta tgagtcgttt acccagaatg cacaggtaca 3900 

cttgtttaga ggtaatcctt ctttctagct agaagtcctc gtgtactgtg taagcgccca 3960 

ctccacatct ccactcgacc tgcaggcatg caagcttgag tctatcgcct ccaaaaagta 4020 

cggtgctgaa ttcagatatc aatcgcctgt tgctaaaatt aacactgtcg ataaagacaa 4080 

gcgtgtaacc ggtgtcactt tggaaagcgg agaagtcatt gaagccgatg cagtcgtatg 4140 

taatgcggat cttgtttatg cttatcacca tctgttacct ccttgcaatt ggacaaagaa 4200 

gacattagcc tcaaagaaac tcacttcatc atctatttcg ttttattggt ccatgtcaac 4260 

aaaggtgcct caattagacg tacacaatat cttcttggct gaagcctaca aggaaagttt 4320 

tgatgagatt ttcaacgact tcggtttgcc ctctgaagct tggcgtaatc atggtcatag 4380 
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ctgtttcctg tgtgaaattg ttatccgctc acaattccac acaacatacg agccggaagc 4 4 4 0 

ataaactgta aagcctgggg tgcctaatga gtgagctaac tcacattaan tgcgtcgcgc 4500 

tcactgcccg ctttccagtc gggaaacctg tcgtgccagc tgcattaatg astcggccaa 4560 

cgcgcgggga gaggcggttt gcgtattggg ccaaagacaa aagggcgaca ttcaaccgat 4620 

tgagggaggg aaggtaaata ttgacggaaa ttattcatta aaggtgaatt atcaccgtca 4680 

ccgacttgag ccatttggga attagagcca gcaaaatcac cagtagcacc attaccatta 4740 

gcaaggccgg aaacgtcacc aatgaaacca tcgatagcag caccgtaatc agtagcgaca 4800 

gaatcaagtt tgcctttagc gtcagactgt agcgcgtttt catcggcatt ttcggtcata 4860 

gcccccttat tagcgtttgc catcttttca taatcaaaat caccggaacc agagccacca 4 920 

ccggaaccgc ctccctcaga gccgccaccc tcagaaccgc caccctcaga gccaccaccc 4980 

tcagagccgc caccagaacc accaccagag ccgccgccag cattgacagg aggcccgatc 5040 

tagtaacata gatgacaccg cgcgcgataa tttatcctag tttgcgcgct atattttgtt 5100 

ttctatcgcg tattaaatgt ataattgcgg gactctaatc ataaaaaccc atctcataaa 5160 

taacgtcatg cattacatgt taattattac atgcttaacg taattcaaca gaaattatat 5220 

gataatcatc gcaagaccgg caacaggatt caatcttaag aaactttatt gccaaatgtt 5280 

tgaacgatcg gggatcatcc gggtctgtgg cgggaactcc acgaaaatat ccgaacgcag 5340 

caagatatcg cggtgcatct cggtcttgcc tgggcagtcg ccgccgacgc cgttgatgtg 5400 

gacgccgggc ccgatcatat tgtcgctcag gatcgtggcg ttgtgcttgt cggccgttgc 5460 
tgtcgtaatg atatcggcac cttcgaccgc ctgttccgca gagatcccgt gggcgaagaa 5520 

ctccagcatg agatccccgc gctggaggat catccagccg gcgtcccgga aaacgattcc 5580 

gaagcccaac ctttcataga aggcggcggt ggaatcgaaa tctcgtgatg gcaggttggg 5640 
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cgtcgcttgg tcggtcattt ccaaccccag agtcccgctc agaagaactc ctcaagaagg 5700 

cgatacaagg cgargcgctg cgaatcggga gcggcqatac cgtaaagcac gacgaagcgg 5760 

tcagcccatt cgccgccaag ctcttcagca atatcacggg tagccaacgc latctcciga 5820 

tagcggtccg ccacacccag ccggccacag tcgatgaatc cagaaaagcg gccattitcc 58S0 

accatgatat tcggcaagca ggcatcgcca tgggtcacga cgagatcatc gccgtcgggc 5940 

atgcgcgcct tgagcctggc gaacagttcg gctggcgcga gcccctgatg ctcttcgtcc 6000 

agatcatcct gatcgacaag accggcttcc atccgagtac gtgctcgctc gatgcgatgt 6060 

ttcgcttggt ggtcgaatgg gcaggtagcc ggatcaagcg tatgcagccg ccgcattgca 6120 

tcagccatga tggatacttt ctcggcagga gcaaggtgag atgacaggag atcctgcccc 6180 

ggcacttcgc ccaatagcag ccagtccctt cccgcttcag tgacaacgtc gagcacagct 6240 

gcgcaaggaa cgcccgtcgt ggccagccac gatagccgcg ctgcctcgtc ctgcagttca 6300 

ttcagggcac cggacaggtc ggtcttgaca aaaagaaccg ggcgcccctg cgctgacagc 6360 

cggaacacgg cggcatcaga gcagccgatt gtctgttgtg cccagtcata gccgaatagc 6420 

ctctccaccc aagcggccgg agaacctgcg tgcaatccat cttgttcaat catgcgaaac 6480 

gatccagatc cggtgcagat tatttggatt gagagtgaat atgagactct aattggatac 6540 

cgaggggaat ttatggaacg tcagtggagc atttttgaca agaaatattt gctagctgat 6600 

agtgacctta ggcgactttt gaacgcgcaa taatggtttc tgacgtatgt gcttagctca 6660 

ttaaactcca gaaacccgcg gctgagtggc tccttcaacg ttgcggttct gtcagttcca 6720 

aacgtaaaac ggcttgtccc gcgtcatcgg cgggggtcat aacgtgactc ccttaattct 6780 

ccgctcatga tcagattgtc gtttcccgcc ttcagtttaa actatcagtg tttgacagga 6840 

tatattggcg ggtaaaccta agagaaaaga gcgtttatta gaataatcgg atatttaaaa 6900 
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gcgcctgaaa agctttatcc gttcgtccat tt gtatc tec atgccaacca cagcgttccc 6S60 

cagatciggc gccggccagc gagacgagca agattggccg ccgcccgaaa cgatcccaca 7020 

gcgcgcccag cacaggtgcg caggcaaatt gcaccaacgc atacagcgcc ageagaatge 7080 

catagtgggc ggtgacgtcg ttcgagtgaa ccagatcgcg caggaggccc ggcagcaccg 7140 

gcataatcag gccgatgccg acagegtega gcgcgacagt gctcagaatt acgatcaggg 7200 

gtatgttggg tttcaegtet ggcctccgga ccagcctccg ctggtccgat tgaacgcgcg 7260 

gattctttat cactgataag ttggtggaca tattatgttt atcagtgata aagtgtcaag 7320 

catgacaaag ttgeagcega atacagtgat ccgtgccgcc ctggacctgt tgaacgaggt 7380 

eggegtagae ggtctgacga cacgcaaact ggeggaaegg ttgggggttc agcagccggc 7440 

gctttactgg cacttcagga acaageggge gctgctcgac gcactggccg aagecatget 7500 

ggcggagaat cataegcatt eggtgecgag agccgacgac gaetggeget catttctgat ■ 7560 

egggaatgee cgcagcttca ggcaggeget gctcgcctac cgcgatggcg cgcgcatcca 7620 

tgccggcacg cgaccgggcg caccgcagat ggaaaeggee gacgcgcagc ttcgcttcct 7680 

ctgcgaggcg ggtttttcgg ccggggacgc cgtcaatgcg ctgatgacaa tcagctactt 7740 

cactgttggg gccgtgcttg aggagcaggc cggcgacagc gatgeeggeg agegeggegg 7800 

caccgttgaa caggctccgc tctcgccgct gttgegggee gcgatagacg ccttcgacga 7860 

agccggtccg gaegcagegt tcgagcaggg actcgeggtg attgtcgatg gattggcgaa 7920 

aaggaggctc gttgtcagga aegttgaagg accgagaaag ggtgacgatt gatcaggacc 7980 

getgeeggag cgcaacccac tcactacagc agagecatgt agacaacatc ccctccccct 8040 

ttccaccgcg tcagacgccc gtagcagccc getaeggget ttttcatgee ctgccctagc 8100 

gtccaagcct cacggccgcg ctcggcctct ctggcggcct tctggcgctc ttccgcttcc 8160 
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tcgctcactg actcgctgcg ctcggtcgtt cgcctgcggc gagcggtatc agctcactca 8220 

aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca 82S0 

aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 8340 

ctccgccccc c'tgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 8400 

acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 84 60 

ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 8520 

ttccgctgca taaccctgct tcggggtcat tatagcgatt ttttcggtat atccatcctt 8580 

tttcgcacga tatacaggat tttgccaaag ggttcgtgta gactttcctt ggtgtatcca 8640 

acggcgtcag ccgggcagga taggtgaagt aggcccaccc gcgagcgggt gttccttctt 8700 

cactgtccct tattcgcacc tggcggtgct caacgggaat cctgctctgc gaggctggcc 8760 

ggctaccgcc ggcgtaacag atgagggcaa gcggatggct gatgaaacca agccaaccag 8820 

gaagggcagc ccacct'atca aggtgtactg ccttccagac gaacgaagag cgattgagga 8880 

aaaggcggcg gcggccggca tgagcctgtc ggcctacctg ctggccgtcg gccagggcta 8940 

caaaatcacg ggcgtcgtgg actatgagca cgtccgcgag ctggcccgca tcaatggcga 9000 

cctgggccgc ctgggcggcc tgctgaaact ctggctcacc gacgacccgc gcacggcgcg 9060 

gttcggtgat gccacgatcc tcgccctgct ggcgaagatc gaagagaagc aggacgagct 9120 

tggcaaggtc atgatgggcg tggtccgccc gagggcagag ccatgacttt tttagccgct 9180 

aaaacggccg gggggtgcgc gtgattgcca agcacgtccc catgcgctcc atcaagaaga 9240 
gcgacttcgc ggagctggtg aagtacatca ccgacgagca aggcaagacc gagcgccttt 9300 
gcgacgctca ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca 9360 
aacgcgccag aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga 9420 
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tacctcgccg aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg 9480 

gccgactcac ccggcgcggc gttgacagat gaggggcagg cccgarttcg gccgccgacg 9540 

tggagctggc cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag 9600 

atgargtgga caagcctggg gataagtgcc ctgcggtatt gacactrgag gggcgcgacr 9660 

"actgacagat gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg 9720 

gcgcacctat tgacattrga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt 9780 

ttccgcccgt ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt 9840 

ataaaccttg tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg 9900 

ggtgcccccc cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg 9960 

gctgcgcccc tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc 10020 

cattgccggg- atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag 10080 

cattgacgtg ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg 10140 

cggcggcctg ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat 10200 

ggcggggccg gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct 10260 

cgtgttcggg ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg 10320 

aggtatgaaa acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa 10380 

agctaccaag acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac 10440 

aatactgata agataatata tcttttatat agaagaratc gccgtatgta aggatttcag 10500 

ggggcaaggc ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa 10560 

cttgcatgga ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca 10620 

taattgggta atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac 10680 
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rttgtcatgc agctccaccg attt tgacaa cgacagcgac ttccgtccca gccgtgccag 10740 

gtgctgcctc agartcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac I0S00 

gtgcagcttt cccttcaggc gggattcata cagcggccag ccatccgtca tccanatcac 10860 

cacgtcaaag ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc 10920 

gaatacgtgc gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg 10980 

gcgcgattta gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc 11040 

actgcccggc tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa 11100 

atcgtgttga ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg 11160 

gccatatcaa tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt 11220 

tgccatgttt tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg 11280 

ttacgcacca ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact 11340 

ggagcacctc aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat 11400 

tgtggtttca aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt 11460 

gaaaaagctg ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc 11520 

gtcttgttat aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat 11580 

aataaatggc taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct 11640 

gcgtaaaaga tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg 11700 

aaaacctata tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac 11760 

gggaaaagga catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact 11820 

ttgaacggca tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct 11880 

cggaagagta tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca 11940 
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tcaggcrctt 
get tagccga 
gggaagaaga 
ageccgaaga 
aagatggcaa 
atgacattgc 
agctattttt 
tactggatga 
cgcaccgact 
ttgggcaagg 
gaeggecaga 
aaggcaccag 
atcccgcaag 
ategacgegg 
gcgccccgcg 
gagegegaca 
cgttcgcgtc 
cgaggaacta 
agegaggeca 
ctttccttgt 
gcccgctctg 
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tcactccatc 

caccccattt 
ggaactitgtc 
agt aagtggc 
cttctgcgtc 
tgacttactg 
attgttttag 
tcttccgcat 
ggtcgctggt 
eggtctaegg 
gcgggtcaaa 
gagggtgaat 
ggttttccgc 
aaaccttcca 
gcgtgcaact 
gtctcgaaca 
tgacgaccaa 
agcaggccgc 
tegatattge 
ccctgttcac 



gaeatategg 
ttactgaata 
aaagatcege 
ttttcccacg 
tttattgatc 
eggtcgatea 
gggatcaagc 
tacctagatg 
caagtgtttt 
attcgtgcag 
gaccgacttc 
tcaggaataa 
gaatcggacg 
egaggatgee 
gtccgtcggc 
ggctccccct 
ggaggeggea 
gaagcgaaaa 
gttgctgaaa 
gccgtggccg 
cacgcgcaac 



attgtcccta 
acgatctggc 
gcgagctgta 
gcgacctggg 
ttgggagaag 
gggaggatat 
ctgattggga 
tggcgcaacg 
ggctctcagg 
ggcaagattc 
attgecgata 
gggcacattg 
tttgaccgga 
gaaaccatcg 
tcgatggtcc 
gccctgcccg 
ggtttggcga 
accgccggcg 
cacacgaagc 
gaeacgatge 
aagaaaatcc 



t acgaa i_agc 

cga t ct ggat 

tgatttttta 

agacagcaac 

eggcagggeg 

eggggaagaa 

gaaaataaaa 

atgeeggega 

ccgaggccca 

ggaataccaa 

aggtggatta 

ccccggcgtg 

aggcatacag 

caagccgcac 

agcaagctac 

cgccatcggc 

agtcgatgac 

aggacctggc 

agcagatcaa 

gagegatgee 

cgcgcgaggc 



ttacacagcc 

tgegaaaact 

aagaccgaaa 

atctt tgtga 

gacaagtggt 

cagtatgtcg 

tattatattt 

caagcaggag 

eggcaagtat 

gtacgagaag 

tctggacacc 

agteggggea 

gcaagaactg 

cgtcatgcgt 

ggecaagate 

cgccgtggag 

catcgacacg 

aaaacaggtc 

ggaaatgcag 

aaacgacacg 

getgeaaaac 



12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 
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aaggtcattt tccacgtcaa caaggacctc aegatcacct acacccgcct cgagctgcgg 13260 

gccgacgatg acgaacrggt gtggcagcag gtgttggact acccgaagcg cacccctatc 13320 

ggcgagccga tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat 13380 

ggccggratt acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc 13440 

ttcacgtccg accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc 13500 

ctggaccgtg gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg 13560 

ctgtttgctg gcgaccacta cacgaaattc atatgggaga agraccgcaa gctgtcgccg 13620 

acggcccgac ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg 13680 

gaaaccttcc gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag 13740 

gtcggcgaag cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat 13800 

gatgacctgg tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca 13860 

gcagccagcg ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc 13920 

gctcagtatc gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa 13980 

aattgacaat tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt 14040 

tccgcgagat ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg 14100 

agcacgagga gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat 14160 

tcggcgccta catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc 14220 

ccaaggacgc tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc 14280 

gaggggtcgc cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg 14340 

tccgacagat tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata 14400 

tttcgctatt ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg 144 60 
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cgacggtagg cgctgtgcag cccctgatgg tcgtgttcat ctctgccgct ctgctaggta 14520 

gcccgatacg atrgatggcg gtcctggggg ctatttgcgg aactgccggc gtggcgctgt 14580 

tgg'tgttgac accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg 14640 

ccgtttccat ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgccrctgc 14700 

tcacctttac cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag 14760 

tgtttgatcc gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg 14820 

gcctgatcgg agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac 14880 

ctacagttgt ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga 14940 

tgcatcaggc cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg 15000 

ataggggagt tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc 15060 

agcggcttta tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt 15120 

cacggttaag cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga 15180 

tatttgatca caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga 15240 

gatcatccgt gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac 15300 

atgagcaaag tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg 15360 

ctgcctgtat cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct 15420 

ggtggcagga tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg 15480 

cggacgtttt taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg 15540 

attgcccttc accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtt tgccc 15600 

cagcaggcga aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca 15660 

aaagaatagc ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta 15720 
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aagaacgtgg actccaacct caaagggcga aaaaccg^ct atcagggcca tcgcccacia 15780 

cgtgaaccat cacccaaatc aagttttttg gggtcgaggt gccctaaagc actaaatccg 15840 

aaccctaaag ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga 15900 

aaggaaggga agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg 15960 

gcgatcggtg cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag 16020 

gcgattaagt tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag 16080 

tgaattcgag ctcggtaccc ggg 16103 

<210> 63 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 
<400> 63 

ggcgtacttg aaggaaccct taccg 25 

<210> 64 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 
<400> 64 

attgatgctc ccggtcaccg tgatt 25 

<210> 65 
<211> 500 
<212> DNA 
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<213> Blakeslea trispora 
<400> 65 

aatctataca atgcrccata gactcacatt gatattgtcg aagaittcga tgcicactta 60 

gtagagcaac tacaaaagtt agcagagaag catgatttct taatctttga agaccgcaag 120 

tttgcagata tcggtatgtg aattctatct attttttttc tgatgtgtgc atggatgact 180 

ca'tgatcata ttcttaggta atactgtcaa gcatcaatat ggcaagggcg tttacaagat 240 

tgcttcttgg tctcatatta ctaatgctca cacagttcct ggagaaggta ttatcaaggg 300 

acttgccgaa gtcggcctcc ctcttggtcg tggcttgctt ttgctagcag aaatgtcatc 360 

tcaaggtgca ttaactaagg gtatttacac tgccgaatct gtcaatatgg ctcgccgcaa 420 

caaagatttc gtttttggct ttattgcaca acacaaaatg aatcagtatg atgatgagga 480 

ttttgttgtc atgtcgcctg 500 

<210> 66 
<211> 611 
<212> DNA 

<213> Blakeslea trispora 
<400> 66 

gagattaaaa tagataagga aaagaaagtg aaaagaaatt cggaagcatg gcacattctt • 60 

ctttttataa atacatgcct gactttcttt ttccatcgat atgatatatg catatgatag 120 

atatacaagc aatcttcttc aaggagtttg aaattttgtc ctccaggagc aaaaaaaagt 180 

ttttttttat acatgtttgt acacaagaat agttaccaat ttgctttggt cttacgtgct 240 

gcaagtttat atcgttttca atttctttgt ctttacattt tctttgtcct ttatctttcc 300 

tcatttagtc tttgggagaa ttaggaaaag ggagcggaaa ggtaagaaat gcttgcgtat 360 

tttactaatt cggcaaacat ccaatttggc aaacagcagc ctgtgcaacg ctctcgagat 420 
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gacagratct ttgattacac tctaaatctc gatgacccga ccaaaaagag cgaacaaaga 480 

aataatcttg tgcattcgaa tatgatggaa gattttttcc cccttattct aaatgrtgac 540 

atagcgtgta tgttatataa acaaaaagaa attgtacaaa ctttcttttc ttctcttttt 600 

attttatctc t 611 

<210> 67 
<211> 720 
<212> DNA 

<213> Blakeslea trispora 
<400> 67 

atgtcaatac tcacttatct ggaatttcat ctctactata cactacctgt ccttgcggca 60 

ttgtgttggc tgctaaagcc gtttcactca cagcaagaca atct-caagta taaattttta 120 

atgttgatgg ccgcctctac cgcatcgatt tgggacaatt atatcgttta tcatcgcgct. 180 

tggtggtact gtcctacttg tgttgtggct gtcattggct atgtacctct agaagaatac 240 

atgttcttta tcatcatgac tttaatgact gtcgcgttct caaactttgt tatgcgttgg 300 

cacttgcata ctttctttat ta-gacccaac acttcttgga agcaaacact attagtacgc 360 

cttgtgcctg tttcagcttt attggcaatc acttatcatg cttggcactt gacactgcca 420 

aataaacctt cattttatgg ttcatgcatc ctttggtatg cttgtcctgt gttggctatt 480 

ctttggctgg gtgctggcga atatatcttg cgtcgacctg tggctgtcct tttgtctatt 540 

gttatcccta gtgtatacct atgttgggct gatatcgtcg ctattagtgc tggcacatgg 600 

catatttctc ttagaacaag cactggcaaa atggtagtac ccgatttacc tgtagaagaa 660 

tgcctgtttt ttactttgat caacacagtc ttggtttttg ctacctgtgc tatagaccgc 720 



<210> 68 
<211> 1089 
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<212> DNA 

<2I3> Biakesiea trispora 
<400> 68 

ctgtacaaat catctgttca aaatcaaaac cctaaacaag ccatttccct tttccagcat . 60 

grcaaagagc tagcatgggc cttctgtctt cctgaccaaa tgctcaacaa tgaattgttt 120 

gatgatctta ctatcagctg ggatatttta cgtaaagcct caaagtcatt ctatactgca 180 

tctgccgttt ttccaagtta tgtacgtcaa gacttgggtg ttctctatgc tttct cgcaga 240 

gctaccgatg acctgtgcga tgatgaatcc aaatctgttc aagaaagaag agaccaatta 300 

gatcttactc gacaatttgt tcgtgatctc tttagccaaa agaccagtgc gcctattgtg 360 

attgattggg aattgtatca aaaccaactt cctgcttctt gtatatcagc cttta<gagcc 420 

tttactcgcc ttcgccatgt ccttgaagta gaccctgtag aagaactatt agatggttac 480 

aaatgggatc ttgagcgtcg tcctatcctt gatgaacaag acttggaggc atactctgct 540 

tgtgtggcca gtagtgtggg tgaaatgtgc acacgtgtga ttcttgctca agaccaaaag 600 

gaaaatgatg cttggataat tgaccgtgca cgtgagatgg ggctggtgct acaatacgtt 660 

aacattgctc gagacattgt gactgatagc gagactctgg gtcgatgtta tctgcctcaa 720 

caatggctta gaaaagaaga aacagaacaa atacagcaag gcaacgcccg tagcctaggt 780 

gatcaaagac tgttgggctt gtctctgaag cttgtaggaa aggcagacgc tatcatggtg 840 

agagctaaga agggcattga caagttgccg gcaaactgtc aaggcggtgt acgagctgct 900 

tgccaagtat atgctgcaat tggatctgta ctcaagcagc agaagacaac atatcctaca 960 

agagctcatc taaaaggaag cgaacgtgcc aagattgctc tgttgagtgt atacaacctc 1020 

tatcaatctg aagacaagcc tgtggctctc cgtcaagcta gaaagattaa gagttttttt 1080 

gttgattag 1089 
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<210> 69 
<211> 611 
<212> DNA 

<213> Blakeslea trispora 
<400> 69 

agagataaaa taaaaagaga agaaaagaaa gtttgtacaa tttctttttg tttatataac 60 

atacacgcta tgtcaacatt tagaataagg gggaaaaaat cttccatcat attcgaatgc 120 

acaagattat ttctttgttc gctctttttg gtcgggtcat cgagatttag agtgtaatca 180 

aagatactgt catctcgaga gcgttgcaca ggctgctgtt tgccaaattg gatgtttgcc 240 

gaattagtaa aatacgcaag catttcttac ctttccgctc ccttttccta attctcccaa 300 

agactaaatg aggaaagata aaggacaaag aaaatgtaaa gacaaagaaa ttgaaaacga 360 

tataaacttg cagcacgtaa gaccaaagca aattggtaac tattcttgtg tacaaacatg. 420 

tataaaaaaa aacttttttt tgctcctgga ggacaaaatt tcaaactcct tgaagaagat 480 

tgcttgtata tctatcatat gcatatatca tatcgatgga aaaagaaagt caggcatgta 540 

tttataaaaa gaagaatgtg ccatgcttcc gaatttcttt tcactttctt ttcct tatct 600 

attttaatct c 611 

<210> 70 
<211> 882 
<212> DNA 

<213> Haematococcus pluvialis 
<400> 70 

atgctgtcga agctgcagtc aatcagcgtc aaggcccgcc gcgttgaact agcccgcgac 60 
atcacgcggc ccaaagtctg cctgcatgct cagcggtgct cgttagttcg gctgcgagtg 120 
gcagcaccac agacagagga ggcgctggga accgtgcagg ctgccggcgc gggcgatgag 180 
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cacagcgccg atctagcact ccagcagctt caccgggcta tcgcagagcc tcgtgcccgg 240 

cgcaaacggg agcagctgtc ataccaggct gccgccattg cagcatcaat tgqcgtgtca 300 

ggcattgcca tcttcgccac ctacctgaga tttgccatgc acatgaccgt gggcggcgca 360 

gtgccatggg gtgaagtggc tggcactctc ctcttggtgg ttggtggcgc gctcggcatg 420 

gagatgtatg cccgctatgc acacaaagcc atctggcatg agtcgcctct gggctggctg 480 

ctgcacaaga gccaccacac acctcgcact ggaccctttg aagccaacga cttgtttgca 540 

atcatcaatg gactgcccgc catgctcctg tgtacctttg gcttctggct gcccaacgtc 600 

ctgggggcgg cctgctttgg agcggggctg ggcatcacgc tatacggcat ggcatatatg 660 

tttgtacacg atggcctggt gcacaggcgc tttcccaccg ggcccatcgc tggcctgccc 720 

tacatgaagc gcctgacagt ggcccaccag ctacaccaca gcggcaagta cggtggcgcg 780 

ccctggggta tgttcttggg tccacaggag ctgcagcaca ttccaggtgc ggcggaggag 840 

gtggagcgac tggtcctgga actggactgg tccaagcggt ag 882 

<210> 71 
<211> 528 
<212> DNA 

<213> Erwinia uredovora 
<400> 71 

atgttgtgga tttggaatgc cctgatcgtt ttcgttaccg tgattggcat ggaagtgatt 60 

gctgcactgg cacacaaata catcatgcac ggctggggtt ggggatggca tctttcacat 120 

catgaaccgc gtaaaggtgc gtttgaagtt aacgatcttt atgccgtggt ttttgctgca 180 

ttatcgatcc tgctgattta tctgggcagt acaggaatgt ggccgctcca gtggattggc 240 

gcaggtatga cggcgtatgg attactctat tttatggtgc acgacgggct ggtgcatcaa 300 

cgttggccat tccgctatat tccacgcaag ggctacctca aacggttgta tatggcgcac 360 
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cgtatgcatc acgccgtcag gggcaaagaa ggttgtgttt cttttccctt cctctatgcg 420 

ccccccctgt caaaacttca ggcgacgctc cgggaaagac atggcgctag agcgggcgct 480 

gccagagatg cgcagggcgg ggaggatgag cccgcatccg cgaagtaa 52S 

<210> 72 
<211> 7 62 
<212> DNA 

<213> Nostoc sp. PCC73102 
<400> 72 

atgatccagt tagaacaacc actcagtcat caagcaaaac tgactccagt actgagaagt 60 

aaatctcagt ttaaggggct tttcattgct attgtcattg ttagcgcatg ggtcattagc 120 

ctgagtttat tactttccct tgacatctca aagctaaaat tttggatgtt attgcctgtt 180 

atactatggc aaacattttt atatacggga ttatttatta catctcatga tgccatgcat 240 

ggcgtagtat ttccccaaaa caccaagatt aatcatttga ttggaacatt gaccctatcc 300 

ctttatggtc ttttaccata tcaaaaacta ttgaaaaaac attggttaca ccaccacaat 360 

ccagcaagct caatagaccc ggattttcac aatggtaaac accaaagttt ctttgcttgg 420 

tattttcatt ttatgaaagg ttactggagt tgggggcaaa taattgcgtt gactattatt 480 

tataactttg ctaaatacat actccatatc ccaagtgata atctaactta cttttgggtg 540 

ctaccctcgc ttttaagttc attacaatta ttctattttg gtactttttt accccatagt 600 

gaaccaatag ggggttatgt tcagcctcat tgtgcccaaa caattagccg tcctatttgg 660 

tggtcattta tcacgtgcta tcattttggc taccacgagg aacatcacga atatcctcat 720 

atttcttggt ggcagttacc agaaatttac aaagcaaaat ga 762 

<210> 73 
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<211> 617 
<212> DNA 

<2I3> Haematococcus pluvialis 
<400> 73 

tagggtgcgg aaccaggcac gctggtttca cacctcatgc ctgtgataag gtgtggcrag 60 

agcgatgcgt gtgagacggg tatgtcacgg tcgactggtc tgatggccaa tggcatcggc 120 

catgtcrggt catcacgggc tggttgcctg ggtgaaggtg atgcacatca tcatgtgcgg 180 

ttggaggggc tggcacagtg tgggctgaac tggagcagtt gtccaggctg gcgttgaatc 240 

agtgagggtt tgtgattggc ggttgtgaag caatgactcc gcccatattc tatttgtggg 300 

agctgagatg atggcatgct tgggatgtgc atggatcatg gtagtgcagc aaactatatt 360 

cacctagggc tgttggtagg atcaggtgag gccttgcaca ttgcatgatg tactcgtcat 420 

ggtgtgttgg tgagaggatg gatgtggatg gatgtgtatt ctcagacgta gaccttgact 480 

ggaggcttga tcgagagagt gggccgtatt ctttgagagg ggaggctcgt gccagaaatg 540 

gtgagtggat gactgtgacg ctgtacattg caggcaggtg agatgcactg tctcgattgt 600 

aaaatacatt cagatgc 617 

<210> 74 
<211> 1208 
<212> DNA 

<213> Haematococcus pluvialis 
<400> 74 

attgtgactg atagcgagac tctgggtcga tgttatctgc ctcaacaatg gcttagaaaa 60 
gaagaaacag aacaaataca gcaaggcaac gcccgtagcc taggtgatca aagactgttg 120 
ggcttgtctc tgaagcttgt aggaaaggca gacgctatca tggtgagagc taagaagggc 180 
attgacaagt tgccggcaaa ctgtcaaggc ggtgtacgag ctgcttgcca agtatatgct 240 
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gcaattggat ctgtactcaa gcagcagaag acaacatatc ctacaagagc tcatctaaaa 300 

ggaagcgaac gtgccaagat tgctctgttg agtgtataca acctctatca atctgaagac 360 

aagcctgtgg ctctccgtca agctagaaag attaagagtt tttttgttga ttagtgaatt 420 

tttgttttat ttatgtctga tagttcaata aagagacaac acatacaata taaaatcatt 480 

gtctttaaat gttaatttag tagagtgtaa agcctgcatt ttttttgtac gcataaacaa 540 

tgaattcacc ccgcttctgg tttttaaata attatgtcaa actagggaaa attctttttt 600 

ttctcttcgt tctttttttg gcttgttgtg gagtcacagg cttgtcttca gattgataga 660 

ggttgtatac actcaa'caga gcaatcttgg cacgt.tcgct tccttttaga tgagctct.tg 720 

taggatatgt tgtcttctgc tgcttgagta cagatccaat tgcagcatat acttggcaag 780 

cagctcgtac accgccttga cagtttgccg gcaacttgtc aatgcccttc ttagctctca 840 

ccatgatagc gtctgccttt cctacaagct tcagagacaa gcccaacagt ctttgatcac 900 

ctaggctacg ggcgttgcct tgctgtattt gttctgtttc ttcttttcta agccattgtt 960 

gaggcagata acatcgaccc aacatcctcg agccatacta cagcataaaa ggatacgttt 1020 

tctttaacag aaatttaccc ttttgttatc agcacataca aaaaaaaaga aatttaagat 1080 

gagtaggact tccattctct caaaaatttt attcaatcca taaatgaatt atttttggac 1140 

aaaaaagaaa gattatgcct gattttctct attttttttt tttttacaac tccaccaata 1200 

ctttctag 1208 



<210> 75 

<211> 6316 

<212> DNA 

<213> Blakeslea trispora 



<220> 
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<22 I > misc_f eature 
<222> (2694) . . (2694) 
<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 
<222> (4263) . . (4263) 
<223> n is a, c, g, or t 

<400> 75 

aaggatgaag aatccaactc taataaaaat cttatggata tctttgatcg actcaaaaag 60 

gctttcaatg ctattgctat taaaaaaaaa gagagagaga gaactatgag caaaaggact 120 

ctatgccaag atggcaaaaa ggcaccagaa acccttagtt tattattgca taatccagtc 180 

gagctagtac ttctgtagct caagcttaac cgaggatctt ggaatcaact cgtctcgtca 240 

ctcttgccga tgatcctaga aatggtatct atggatgtta tactaacatt gttatctttc 300 

aaggcctcga agatgttatt gttgcggtga taaataggct gctatgtact gaagttgctc 360 

tgtaaaatga atctagttca ctgcctactc agcaaatggt tgtttctaat gtctttaaag 420 

aaagaaaaaa agatacatat agactaccct tcctttcaag actgtaatcg agaatcggcc 480 

gatggtttat tacaattaga cgctgggaat aagcaaaagg attcatcttt gtaaataaga 540 

gactggtgca tatgaaagca aggatcgtat caaggaatag ttttgatcga gcatcaccag 600 

caaatgctgc taatgttggc ttcttctttg cttcctgaga ttgaatggga tgtgcctaga 660 

gcattgctat ttttaagtgt atactttaga tttgtgtctt tagatttgtg tcattttatt 720 

tagtcaagaa agatccccct ttctctatgt atgctaagaa gaaggagcaa gaagtgtatt 780 

tacaagttgg aatgagattg aaatattgta cataataata ataaaaagaa aggtagatca 840 

aaaaaaatgt tctgcctatt gtaagaaatc gggaccaaca ggtgcttgat aaccagaagt 900 

agcttccaat tcaggtagag gctctaggga caaatacaca attatgacag gaattttctt 960 
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gttgacttga acactacaag agaaacgggt cagcacaaaa tccgaaaaaa aaaagaaacg 1020 

gaccattcat gtcttaccta tctagctctt tgtcttcaat tgcatcccat tgctcaacca 1080 

cagatacgct tcccaattga gtatattgat gaagtgttcc ctgcattttt cgcttgacta 1140 

attccactac agtcacagtc ttattaatgt tttgtccttt accagtcagg ataatatgat 1200 

ctttttgctt cttctatcaa aaaaataatt cttgttttga ataaaaaaaa caaatattta 1260 

aagaaactac tttgatgacg gtacctggaa taactcgaga cacacatcta catatgcgtt 1320 

gattttattg tggctaattc gaacctcatt ttctgctggt gggggctgtt gactttcagt 1380 

tgctgagacg tccttcttgc ttcttttat a gtcttccact atgattttaa tcaagaaagt 1440 

aagtcagtga tgattgttac aagctatata tcttgaaaaa gaacagagag gtattattat 1500 

cagatgcaac atggttttct gtatcatttt catttcagtt tctctgttca aaaaaaaaaa 1560 

gaacactttc tctttccact cctcaaattt tttctgctaa actcctcgca aaacatgtat 1620 

ttgctttaaa ctacaagttg caattgtctg atttagcaat ttcaatatgc cttttgtgaa 1680 

tccacccaaa aataaacaag tgcttgagta tacttgggtt cagttcaaaa gaaagcaagc 1740 

tttttttttt ctttcttggg aaagaaaaaa aaatattgtt gagccatcct ttaccagcag 1800 

tatgcgagct acgacatagc tggtctaaca atgactgcaa gcaatagatc gagcttagtc 1860 

tttctattgc ttcyttgttt. gatctatgtt cggccttacg ctgacctatc caatactcga 1920 

gataggcaac aagatttcga acagtaatga aataaatttc ggataacagt tgtggatgag 1980 

gaagagaaag cgacttgaac tcgagaaact ttgttgaaat gaaatccgac cttttacgtg 2040 

atcatcatgt attatcctct ttttcttttt tttcgtagtg aattacttac tgattgcgct 2100 

caagtcgcgt ctttataaag aagaaaaaaa aatattagaa ctttcaaaaa atataactga 2160 

aaataaaagt gtggctcgga gagcaaatac cacatccttt gtcttcgctt tggtaacacg 2220 
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gttaataagc cactataggt gaataatgat catttctgag aataaagcgc ggcrtgaagc 2280 

ttatatccat atcaggattc atattaggca caactcacaa ttgacgttcc agaactccca 2340 

attttttttt cctgatagcc tgtccaatta agatcaaaaa ccactgagtt ttctctatat 2400 

artttttttt ttcataattc ttaactcttc ttcctctctc tctctctctc tctctttttg 2460 

gcttgcaaaa aaaatcttta gtaataccaa agaaagcaaa ccttttcctt ttcttatttc 2520 

cttgcttgtt ttttaatttt tgatttctct atgctttaaa tacccatttc tttctttctt 2580 

ctgctattac ctatcttttc attcctctcc cccctctctc tcttggtcta taaacatcat 2640 

gaagtcctct tttaaaagtt cgcttgacat ttatgctgtt tatatacagc atcntgtgtt 2700 

ttccaagtgg ttcattcttg cttttgttct ttcgattttc ctcaacactt atctactgaa 2760 

cgcttcgaag caacagccca aagtgataat caaaaaggtt attgagcggg tagaag'tacc 2820 

aagtagagaa caacctaaat cagtcataaa gccctcctcc aagaaacact cttctcatca ■ 2880 

tcagtctgat gtcattcgcc ctcttgatga agtattgggt ttgctcggaa cacccgaggc 2940 

cttgactgat gaagagatca tctctattgt tcaagctggt aaaatggccc cctatgctct 3000 

tgaaaaggtc ttgggcgatt tagagcgcgc tgtccatatc cgtcgtgctt tgatctcccg 3060 

tgactctcgt acgaaaactt tggaagacag tatgcttccc gtgaaaaact atcattatga 3120 

taaagtcatg ggtgcttgtt gtgaaaatgt cattggttat atgcctattc cagtaggtgt 3180 

cgcaggtaag aagttcaaca agtcgcgata tttgacaagt tgctcatcat tttcgaaaca 3240 

ggtcctttgg tgattgatgg tgattctatt catattccca tggcaactac ggaaggttgt 3300 

ttagttgctt ctactgccag aggttgtaaa gcaatcaatg ctggtggtgg tgccaacaca 3360 

attgttgttg ctgatggtat gactcgaggt ccttgtgtcg aatttcctac aatcactcgc 3420 

gctgctgact gtaaacgatg gattgaacaa gagggtgaag ctatcgtgac cgaggcattc 3480 
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aattcaactt ctcgttttgc tcgtgttcgt aaattgaaag ttgctcrtgc cggtcgtcta 3540 

gtctacatcc ctttctctac cactacaggt gatgcaatgg gcatgaacat catctccaag 3600 

ggttgtgaaa aggctttaag caagattgct gagagatatc ctgatatgca gatcatttct 3660 

ctttctggta actattgtac tgacaagaaa cctgctgcta tcaactggat tgaaggacgt 3720 

ggtaaatctg ttgttgctga sgctgtcatc cctggtacgg ttgtcgaaaa ggtatrgaag 3780 

acctctgtta gtgctttggt tgagctgaac atctctaaaa acctggttgg ttctgctatg 3840 

gctggctccg tcggtggctt taacgctcat gctgctaata ttctaactgc catttacctt 3900 

gctactggtc aagatcctgc tcaaaatgta sagagttcta actgtattac tttgatgaaa 3960 

gctgtcaatg gcgaaagaga ccttcatatc tcttgtacaa tgccctgtat tgaagtaggc 4020 

accattggtg gtggtactat tttgcctcct caacaagcca tgttggattt cattggtgtg 4080 

cgtggtcctc accctaccga acctggtgcc aatgcccgwc gccttgctcg tgttatctgt 4140 

gcctctgtga tggctggtga attgtcttta tgtgcagctt tggctgctgg tcatcttgta 4200 

aaggcacaca tggctcataa tcgtaatacc actgctgctg ccgctgttgt tcctgcccct 4260 

aanggcatag ttgatgtctc tacacctcct gctacacctg cagaaaagaa tgatcctatt 4320 

cctggaagtt gtatcaagtc atagaattaa tattatatat atatcatata caaaaaaaag 4380 

aaaaaaaaaa cactacatct atttatattt ctccatgtac acacacacac acacatataa 4440 

aaactcttta ttttccaata ttttgctttt ataaataatc ttatttcatt ctaaataaac 4500 

tgtttttttt tattaatcat caaaccctgc tgagagctgt gcaatatcat ctatgttttc 4560 

atggtttaac tctggtatcg gwcgagcctc ctctgtactt gaagtttgta ggcagttttt 4620 

atttaaggct gctggtcgat catgatcatc akcaaacctg acagcatgaa gtttt gactg 4680 

atgagcaatt tcactaaggg cagaatctga actctttcgc ttcctactat tgaccatatt 4740 
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gtctttaggt ggaatgagtg aatagcgtcr tgtcatatgt aacacagaat caacaatatc 4800 

ctggrgatga aactcggcca aacatagcgc ctttctcccc caacaattat aataatcaaa 4S6C 

atgagaatga catgtacggt tttcctcgat gacaatatcc aacgtcttgt cataatcctc 4920 

tgtgcgyata ccattcatct tttggaagaa cgcacggtag ctctcacaag ctgtcctcag 49S0 

agagttccgt gccatgtttc ccaatgctcc tggcaagtcg aaatgaagtt gtcgaatctg 5040 

gcgatgtatg fctacaatgt cgcctgtttc tttcattaga tcaagcattc gtgtagccca 5100 

aatgatgtct atgttatgat tttctttcat tccagtaata actatagttt ctcggcaaat 5160 

cgaatgastg atggagtaaa ttcatcaaaa gtgcaagtaa tacatacagt gcttgaagaa • 5220 

atcttgtgta gcacgcctat attatgtaat ataggatcga ttctcgaaac tcgacataac 5280 

caccaggctt tagcaagcgt tttatttcat tcatgacaag ctattgttaa ttcytgctta 5340 

ataaaacaaa atgaaaaaaa catacccccc tcmaaactta cttcccactc ttgattggaa 5400 

aaacaggtat agacgtgacg catatgtata taatcaaaac actcatcagg atagggtaaa 54 60 

ccattgagca catcgcattg ggtgaagaaa gtattaggag gcttgatggc tgtaggatat 5520 

ataggtgcaa tatcaatacc gtaaaactca gcatttggga attctgtagc catctccaga 5580 

atccaagtac ctgtgccaca agcaacatca agcactttag gtaagggtat acattgttgt 5640 

tcttgttgtt gttgttgaca atcacttgag tctgagtttc gttttgattg ttttaatgac 5700 

aataattctt ttacaggtgc tgagaaatta ccgtcaaata gatacttgta aataaaatgc 5760 

taaaaataaa aacaatagaa aaaaaaattg acgctcattt cattactatg gaaataactg 5820 

caaaatctta ccacttgtac aagtctatct tgctcaatct catcgtttgg cagaatgtat 5880 

ttattgttgt agtattgata tcttctacca ttcatgatat aactgtcgct tctaatgctc 5940 

tgaggtgaag tacttgtagg tgaaggtgga agtgacgcaa ttttgtcaag cttaacagga 6000 
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tcctctcggc tacatgtttt ctgcatatca ggaaaatctt gtttatttga aacatcaaca 6060 

gtagatgt gg tgtgatcttt tttgaaaata tcgatgcctt cctttgaaag ccttttgaaa 6120 

ggctctttta acttttttga gtgagagcta cccatgatag cttatgaaga attaaaaaga 6180 

aaaaagcaaa aaaaattaaa aaaaaaaaaa gtagcaaaaa attctgtcgt aattatacaa 6240 

gccaatcaaa atcgaaattc atgcaaggca tagatgttca cgtggatttg atggt tgatc 6300 

cttttttttt gcaaga 6316 

<210> 76 
<211> 1170 
<212> DNA 

<213> Thermus thermophilics 
<400> 76 

atgaagcgcc tttccctgag ggaggcctgg ccctacctga aagacctcca gcaagatccc 60 

ctcgccgtcc tgctggcgtg gggccgggcc cacccccggc tcttccttcc cctgccccgc 120 

ttccccctgg ccctgatctt tgaccccgag ggggtggagg gggcgctcct cgccgagggg 180 

accaccaagg ccaccttcca gtaccgggcc ctctcccgcc tcacggggag gggcctcctc 240 

accgactggg gggaaagctg gaaggaggcg cgcaaggccc tcaaagaccc cttcctgccg 300 

aagaacgtcc gcggctaccg ggaggccatg gaggaggagg cccgggcctt cttccjgggag 360 

tggcgggggg aggagcggga cctggaccac gagatgctcg ccctctccct gcgcctcctc 420 

gggcgggccc tcttcgggaa gcccctctcc ccaagcctcg cggagcacgc ccttaaggcc 480 

ctggaccgga tcatggccca gaccaggagc cccctggccc tcctggacct ggccgccgaa 540 

gcccgcttcc ggaaggaccg gggggccctc taccgcgagg cggaagccct catcgtccac 600 

ccgcccctct cccaccttcc ccgagagcgc gccctgagcg aggccgtgac cctcctggtg 660 

gcgggccacg agacggtggc gagcgccctc acctggtcct ttctcctcct ctcccaccgc 720 
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ccggactggc agaagcgggt ggccgagagc gaggaggcgg ccctcgccgc cttccaggag 7S0 

gccctgaggc tctacccccc cgcctggatc ctcacccgga ggctggaaag gcccctcctc 840 

ctgggagagg accggctccc cccgggcacc accctggtcc tctcccccta cgtgacccag 900 

aggctccact tccccgatgg ggaggccttc cggcccgagc gcttcctgga ggaaaggggg 960 

accccttcgg ggcgctactt cccctttggc ctggggcaga ggctctgcct ggggcgggac 1020 

ttcgccctcc tcgagggccc catcgtcctc agggccttct tccgccgctt ccgcctagac 1080 

cccctcccct tcccccgggt cctcgcccag gtcaccctga ggcccgaagg cgggcttccc 1140 

gcgcggccta gggaggaggt gcgggcgtga 1170 

<210> 77 
<211> 2981 
<212> DNA 

<213> Blakeslea trispora 
<400> 77 

tctagaattc attccattcg aaaggatcaa cataaccaat ttaatgacta ctagctaatg 60 

gatacaaata tacgcacaaa aaaagaaaga attctatgat caaagagaac acagacacag 120 

agtgatacat ttaaatggtt aagttcttat gatgttaaaa tggtaacttt attattgaat 180 

taaatgcgaa tatcgttgct gctttgtact tggaaaacgt taggtaaaag ttggttaatg 240 

aaagaagcag gagttgtagt atcatctctt gggaagaaat agaaaaagag gaaagtaaca 300 

aagtaacaag caagacaata atagatccaa tggctttcgg tcttacgagt ttgttcagga 360 

gcatacttct tttggctatc ttgtaacttt cttggtaagg gattctggcc aaagctttta 420 

cagacttggt cggaagtaag cttacttcca gcaagaacga taggaacacc agtacctgga 480 

tgtgtactac aaagaaaaga gaaatgagta cgtgcgttat taaaaaaaag aaaaaaagag 540 
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ggcaaaagta ttacctagct ccgacaaaga aaagattatc ataacggttt gtggaatcct 600 

tggtactagg tctgaaccag agaacttgga acacatcatg agaaagacca agaatagaac 660 

ctctccaaag gttaaacttg ctttgccaaa cactaggatc attcacttct tcangttcaa 720 

tcaaattagc aaagttgttt actcccaaac gacgttcgat aacttccaga accatcttgc 780 

gtgcacggtt taccaactca ggataatttt cttcagcact gtttcctgtc ttactcttca 840 
tatggccaat tggaaccaac acaataatgg agtccttgtt gggaggtgcg gcagattcat 900 
caattcgaga tggaacgttg acatagaatg aagcttcaga gggcaaaccg aagtcgttga 960 

aaatctcatc aaaactttcc ttgtaggctt cagccaagaa gatattgtgt acgtctaat.t 1020 

gaggcacctt tgttgacatg gaccaataaa acgaaataga tgatgaagtg agtttctttg 1080 

aggctaatgt cttctttgtc caattgcaag gaggtaacag atggtgataa gcataaacaa 1140 

gatccgcatt acatacgact gcatcggctt caatgacttc tccgctttcc aaagtgacac 1200 

cggttacacg cttgtcttta tcgacagtgt taattttagc aacaggcgat tgatatctga 1260 

attcagcacc gtactttttg gaggcgatag actcaagctt ctgaacaacc atgttgaaac 1320 

caccacgagg ataccagata ccttcagcaa actcggtgta ttgtaacaaa ctgtaaactg 1380 

ctggagcatc ataaggcgac atactatatt ccaaaaatag aaaatagaac aatgaatatc 1440 

aaaattcctt tcacttgccc tttttcacat ttctcttttc ccacccccga ccggtctcac 1500 

tcattttttt ttcatcccac accacgcgtt gtatgtgtac ttaccccata tacattgttt 1560 

gaaaagtaaa agccatacgc attttcttgg tttggaaata tttactggct cggtcataga 1620 

tcttaccaaa caagtgcaag cgaaagattt caggcacata ctgaagacga atcaaatccc 1680 

aaatggtttc aaagttgcgc ttgatagcaa taaatgtacc ttgttcataa tggacatgtg 1740 

tttccttcat gaaatccaag aatctaccaa atccaagggg accctcaata cggtccaatt 1800 
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cgcccttcat cttggttaaa tcggaagaga gttgtacggc atcaccgtcg rcaaaaigaa I8 60 

ccttatagtt attgtcacag cgaagcaaat ccaaatgatc accaaiacgt tcatccaaat 1920 

cagcaaatgc atcttcaaaa agcttaggca tcaaatagag tgagggaccc tgatcaaagc 1980 

gatgaccatc gtgatgaatg aatgaacaac ggccaccgga aaagtcgttc ttttcaacaa 2040 

cagtaactcg aaaaccttca cgagcaagac gagcagcagt agcagttccg ccaataccgg 2100 

ca'ccaatgac aacaatatgc ttcttttgat cagacatgag attaaaatag ataaggaaaa 2160 

gaaagtgaaa agaaattcgg aagcatggca cattcttctt tttataaata catgcctgac 2220 

tttctttttc catcgatatg atatatgcat atgatagata tacaagcaat cttcttcaag 2280 

gagtttgaaa ttttgtcctc caggagcaaa aaaaagtttt tttttataca tgtttgtaca 2340 

caagaatagt taccaatttg ctttggtctt acgtgctgca agtttatatc gttttcaatt 2400 

tctttgtctt tacattttct ttgtccttt a tctttcctca tttagtcttt gggagaatta • 2460 

ggaaaaggga gcggaaaggt aagaaatgct tgcgtatttt actaattcgg caaacatcca 2520 

atttggcaaa cagcagcctg tgcaacgctc tcgagatgac agtatctttg attacactct 2580 

aaatctcgat gacccgacca aaaagagcga acaaagaaat aatcttgtgc attcgaatat 2640 

gatggaagat tttttccccc ttattctaaa tgttgacata gcgtgtatgt tatataaaca 2700 

aaaagaaatt gtacaaactt tcttttcttc tctttttatt ttatctctat gtcaatactc 2760 

acttatctgg aatttcatct ctactataca ctacctgtcc ttgcggcatt gtgttggctg 2820 

ctaaagccgt ttcactcaca gcaagacaat ctcaagtata aatttttaat gttgatggcc 2880 

gcctctaccg catcgatttg ggacaattat atcgtttatc atcgcgcttg gtggtactgt 2940 

cctacttgtg ttgtggctgt cattggctat gtacctctag a 2981 

<210> 78 
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<211> 1749 
<212> DNA 

<213> Blakesiea trispora 
<400> 78 

atgtctgatc aaaagaagca tattgttgtc attggtgccg gtattggcgg aactgctact 60 

gctgctcgtc ttgctcgtga aggttttcga gttactgttg ttgaaaagaa cgacttttcc 120 

ggtggccgtt gttcattcat tcatcacgat ggtcatcgct ttgatcaggg tccctcactc 180 

tatttgatgc ctaagctttt tgaagatgca tttgctgatt tggatgaacg tattggtgat 240 

catttggatt tgcttcgctg tgacaataac tataaggttc attttgacga cggtgatgcc 300 

gtacaactct cttccgattt aaccaagatg aagggcgaat tggaccgtat tgagggtccc 360 

cttggatttg gtagattctt ggatttcatg aaggaaacac atgtccatta tgaacaaggt 420 

acatttattg ctatcaagcg caactttgaa accatttggg atttgattcg tcttcagtat 480 

gtgcctgaaa "tctttcgctt gcacttgttt ggtaagatct atgaccgagc cagtaaatat 540 

ttccaaacca agaaaatgcg tatggctttt acttttcaaa caatgtatat gggtatgtcg 600 

ccttatgatg ctccagcagt ttacagtttg ttacaataca ccgagtttgc tgaaggtatc 660 

tggtatcctc gtggtggttt caacatggtt gttcagaagc ttgagtctat cgcctccaaa 720 

aagtacggtg ctgaattcag atatcaatcg cctgttgcta aaattaacac tgtcgataaa 780 

gacaagcgtg taaccggtgt cactttggaa agcggagaag tcattgaagc cgatgcagtc 840 

gtatgtaatg cggatcttgt ttatgcttat caccatctgt tacctccttg caattggaca 900 

aagaagacat tagcctcaaa gaaactcact tcatcatcta tttcgtttta ttggtccatg 960 

tcaacaaagg tgcctcaatt agacgtacac aatatcttct tggctgaagc ctacaaggaa 1020 

agttttgatg agattttcaa cgacttcggt ttgccctctg aagcttcatt ctatgtcaac 1080 

gttccatctc gaattgatga atctgccgca cctcccaaca aggactccat tattgtgttg 1140 
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gttccaattg gccatatgaa gagtaagaca ggaaacagtg ctgaagaaaa ttatcctgag 1200 

ttggtaaacc gtgcacgcaa gatggttctg gaagttatcg aacgtcgttt gggagtaaac 1260 

aactttgcta atttgattga acatgaagaa gtgaatgatc ctagtgtttg gcaaagcaag 1320 

tttaaccttt ggagaggttc tattcttggt ctttctcatg atgtgttcca agttctctgg 1380 

ttcagaccta gtaccaagga ttccacaaac cgttatgata atcttttctt tgtcggagct 1440 

agtacacatc caggtactgg tgttcctatc gttcttgctg gaagtaagct tacttccgac 1500 

caagtctgta aaagctttgg ccagaatccc ttaccaagaa agttacaaga tagccaaaag 1560 

aagtatgctc ctgaacaaac tcgtaagacc gaaagccatt ggatctatta ttgtcttgct 1620 

tgttactttg. ttactttcct ctttttctat ttcttcccaa gagatgatac tacaactcct 1680 

gcttctttca ttaaccaact tttacctaac gttttccaag tacaaagcag caacgatatt 1740 

cgcatttaa 1749 



<210> 79 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 79 

ccgatggcga cgacggaagg ttgtt 25 



<210> 80 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 



BASF NAE 878/03 
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<223> Primer 
<400> 80 

catgttcatg cccattgcat cacct 



